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SYNOPSIS 

In this dissertation an attempt has been made to study the properties of the submodular 
functions having non zero values at the empty set. When the value of / at the empty set is 
nonnegative the greedy algorithm will yield an optimal solution to the Linear Programming 
Problem (LPP) associated with the polyhedron P{f). But the greedy algorithm may fail 
to solve the LPP when the submodular function has negative value at the empty set. We 
investigate the reasons why this happens. An algorithm to solve this class of LPP using 
minimisation of a submodular function as an oracle is given. Some example where the 
submodular functions with negative values at the empty set occur are given We show that 
the system 

{Max cx \ X e P{f), f{4>) < 0 } 

is also totally dual integral. We give connectedness properties for the submodular system 
(/, I?) when, f{(j)) < 0, and show that the base polyhedron B(f) corresponding to the 
polyhedron P(f) may be empty even when P(f) # (j). 

Following this, properties of bisubmodular functions with negative values at the empty set 
have been studied. The relationship between greedy algorithm and bisubmodularity was 
first pointed out by Kabadi and Chandresekeran. They have shown that when /(p. 4) = 0) 
then the greedy algorithm implies bisubmodularity and vice versa. In this chapter an <- 
attempt has been made to study the relationship between bisubmodularity and the greedy 
algorithm when 4) ih 0- 

The LPP associated with bisubmodular polyhedron can be solved by generalised greedy 
algorithm when the value of / at empty set is nonnegative. But when f{4,4) < 0 the 
generalised greedy algorithm may fail to produce even a feasible solution. We have shown 
that it is not easy to solve the LPP associated with such functions as it is in the case for 
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submodular functions. However, we give efficient algorithm for solving the LPP 
{Max cx\x£ 'RT-, x{A,B) < f{A,B), {A,B) C (X,F)} 

Where W = {cj j Cg, > 0}, F = (cj | Cg, < 0} and /(iji), 0) < 0. 

Some of the existing results for bisubmodular functions assumed to have zero value at the 
empty set is extended to functions having nonzero values at the empty set. 

Wh next study the Bisubmodular Systems. First we prove some properties of increas- 
ing bisubmodular functions and their associated polyhedra. We have shown that m an y 
properties of the polyhedron associated with increasing bisubmodular function and the 
polymatroid polyhedron are similar. It has been proved that the tight sets corresponding 
to the non zero components of u G P(/) are non-canceling. This property helps us to 
define support function, closoure function and the partial order for the extreme points 
corresponding to the increasing bisubmodular polyhedron. 

Min-Max theorem for the bisubmodular system is proved in a way different from the 
existing proofs. Relationship between separation problem and bisubmodular function min- 
imization is given. At the end we minimise bisubmodular functions with respect to some 
simple constraints that arise in practical problems. 

In the last chapter we develop a variant of cycle-canceling algorithm, for solving minimum 
cost network and submodular flow problems. Our cycle-canceling algorithm for the mini- 
mum cost network flow problem performs the following steps on the residual network. We 
look for augumenting cycles as well as dynamically update potentials of the nodes block- 
ing the search. The basic idea is to reduce the optimality violation of all arcs incident 
to the node blocking the search by half. The main feauture of our algorithm is that it 
simultaneously finds negative cycles and tightens optimality violations by using the basic 



graph search algorithm. This algorithm has also been extended to solve the minimum cost 
submodular flow problems. Like all the algorithms for the minimum cost submodular flow 
problem this algorithm also requires an oracle for minimising a submodular function. Since 
computational experimentation has been the primary method for comparative evolution of 
network flow algorithms, a study on computational behaviour of our algorithm has been 
carried out. We compare our algorithm with the relaxation algorithm for solving minimum 
cost network flow problem. The test problems used are the standard bench marks gener- 
ated by the publically available Netgen program. 

At the end we give concluding remarks and highlight scope for future research. 



VI 


V EVXC AT ST> TO 
My V ATZSMTS 



Vll 


ACKNOWLEDGEMENTS 

I have great pleasure in expressing my sincere gratitude to my thesis supervisor 
Prof. (Mrs.) Prabha Sharma, for excellent guidance, constructive criticism and constant 
encouragement throughout the course of this study. She has always given me free hand to 
develop my own ideas and has always helped clarify my concepts. 

I take this opportunity to thank Dr S.K Gupta, Dr Ravindra K. Ahuja and 
Dr P. Shanmugaraj for their suggestions and help in my thesis work. My sincere thanks 
are due to Drs. V.Raghavendra, S.Madan, D.kundu, Santokh Singh, B.V Rathish Kumar 
and other members of faculty who have helped and encouraged me in many ways. 

I express my sincere thanks to my senior Dr P.T Sokkalingam for the help I received 
from him in my thesis . It is a pleasure to acknowledge my colleagues Guha, Sokks, Durai, 
Sivaguru, Arumugam, Kuppi, Murthy and many others for their help and company during 
my stay in the campus. 

I thank all non teaching staff of my department for their help I also thank canteen 
Narayanan for his valuable service. 

Many friends have made my stay in IIT Kanpur a memorable one. It will be very 
difficult to forget friends like “A.P”, Ravi, Umamahesh, Raghu, Captain Ramesh, Guru, 
Kasi, Seeni, Ilango, Ponds, Kadi Siva, Durai, Rsel, Manoravi, Sokks, Tamil, Sampath, 
"Periyavar” Elango, Immi(shivde), Yogi, Arumugam, Kuppi, “Koys”, Justin, Vivek. Dhans, 
Anbu “Bucket”, Punns, Kottes, Kumar, Shankar, Johnson, Sivaguru, Periyaps, Kannan, 
Jeebu, Yettu, Suresh, Subash, Appaji, Murty, Nattu, Jeer, Venkatesh(NSI), Ramesh Rao, 
Ravichandran, Nari, Siva, Balaji, Murali, Marthandam, Muthu, G.K, Deepak, Joyadeep 
and many others who do not figure in this list. 

I also thanks Mrs Ramesh, Mrs Usha Elango, Mrs Anitha venkatesh and Mrs Sridevi 
Punniyamurthy for their hospitality. 

I am grateful to all my school and college teachers who have encouraged me throughout 



vii: 


my studies. 

I am grateful to my father-in-law and motter-in-law for their constant support and 
encouragement. 

I would like to acknowledge my parents for their affection and encouragement and for 
instilling in me a love for learning. 

Finally I offer my heartfelt thanks to my wife for her love and understanding nature 
during my research. 

S Rajesh 



Contents 


1 INTRODUCTION 1 

1.1 Literature survey 1 

1.2 Preliminaries 3 

1.3 Out line of the thesis 11 

2 On Submodular functions having non zero values at the empty set 15 

2.1 Introduction 15 

2.1.1 Linear programming problem LPP 16 

2.1.2 Algorithm for solving LPP over P{f), when f{(j>) ^0 20 

2.2 Examples 26 

2.3 Properties of submodular functions with <0 29 

3 On Bisubmodular functions having non zero values at the empty set 35 

3 1 Introduction 35 

3.1.1 Linear programming problem (LPP) 37 

3.1.2 Algorithm 42 

3.2 Examples 47 

3.3 Properties of BSM functions with (f>) < 0 49 

4 Bisubmodular Systems 55 

4.1 Introduction 55 


IX 



X 


4.2 Increasing Bisubmodular functions 56 

4.3 General bisubmodular functions 62 

4.4 Min-Max theorem for a bisubmodular system 67 

4.4.1 Separation problem 71 

4.4.2 Minimizing Bisubmodular functions subjected to some simple con- 
straints 74 


5 Minimum cost Network and Submodular flow problems 77 

5.1 Introduction 77 

5.2 Preliminaries 78 

5.3 A New Cycle cancelling Algorithm 81 

5.3.1 Current Arc Implementation 89 

5.4 Complexity Analysis .... 91 

5.5 Computational Experimentation 93 

5.5.1 Introduction 93 

5.5.2 Computational Implementation 94 

5.5.3 TEST CONDITIONS 95 

5.5.4 Computational Results 96 

5.5.5 Conclusion 97 

5.6 Submodular flows 105 

5.6.1 Minimum cost-submodular flows 106 

5.6.2 Cycle cancelling algorithm 107 

5.6.3 Optimality conditions 109 

5.6.4 Complexity 113 

5.7 Concluding Remarks 114 


APPENDIX 


122 



Chapter 1 

INTRODUCTION 

1.1 Literature survey 

Edmonds in his pioneering paper in 1970 introduced submodular functions and the sub- 
modular polyhedron. In particular the matroid polyhedron is a submodular polyhedron 
and the matroid rank function, a submodular function on the subsets of the ground set E 
of the matroid. He showed that a submodular polyhedron has the total dual integrality 
property and in case the submodular function defining the polyhedron is integer valued 
then every extreme point of the polyhedron is an integer valued vector. Edmonds also con- 
structed the greedy algorithm to solve the LPP associated with a submodular polyhedron. 
Further research focused on finding the larger class of polyhedra which have the total dual 
integrality property and also the greedy property m a generalised sense. Some of the results 
are in [37], [49], [50], [53] and [46]. 

Submodular functions have a lot of applications in practical engineering problems. See 
[35] [43] and [48] for details. The importance of submodular functions has been widely 
recognized in recent years in combinatorial optimization and other fields of combinatorial 
analysis. 

Another aspect of submodularity theory is minimization of a submodular function. In [32] 
Grotschel et.el have devised a strongly polynomial algorithm for minimising a submodular 
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function, whicli relies heavily on the ellipsoidal algorithm. Oracle for minimising a sub- 
modular function is required for solving submodular flow problems and many other related 
problems. 

The membership problem for a submodular polyhedra is, given a vector x G 7?.”, determine 
whether x G P(/) or not. It has been shown that the membership problem is equivalent 
to minimising a submodular function [42]. 

Pseudo-polynomial algorithms for the membership problem are given in [13], [14], [45] and 
[51], which can also be used for minimising a submodular function. 

Another aspect of submodularity theory which has been extensively studied is the poly- 
hedral structure of the submodular polyhedron. For details you may see [54], [25] and 
[57]. 

In the direction towards generalizing the results based on submodular functions, comes the 
theory of bisubmodular functions, and bisubmodular polyhedra. 

Welsh [56] considered a general class of polyhedra for which the greedy algorithm works. 
This class of polyhedra is exactly that of bisubmodular polyhedra determined by bisub- 
modular functions. The relationship between greedy algorithm and bisubmodularity was 
first pointed out by Kabadi [36], Chandrasekaren and Kabadi [12]. Various aspects of 
these bisubmodular polyhedra and the bisubmodular functions have been studied in [17] 
and [47]. Different approaches to bisubmodular systems were given by L. Qi [46], Fujishige 
[4] and Guha et.el [33]. 

Another major area of submodularity theory is submodular flows. Submodular flow prob- 
lems are classical generalization of network flow problems. Network flow problems with 
submodular boundary constraints, are called submodular flow problems. It includes the 
(poly)-matroid intersection problem of J.Edmonds [19] and the submodular flow problem 
of J. Edmonds and R. Giles [18]. 

Researchers have proposed many algorithms to solve the minimum cost network flow prob- 



lem. Most of the fundamental algorithms such as primal network simplex algorithms, the 
primal-dual algorithms, the successive shortest-path algorithm, the out of kilter algorithms 
and cycle-cancelling algorithms were developed in 1950’s and 1960’s. 

The development of the first polynomial algorithm by Edmonds and Karp [38] have cre- 
ated interest in developing polynomial and strongly polynomial versions of traditional 
algorithms. 

For solving the submodular flow problem the algorithms such as primal network simplex 
algorithm, primal-dual algorithm and cycle-cancelling algorithm for min-cost network flow 
problems have been extended. These algorithms were developed in 1980’s and need an 
oracle for minimising a submodular function. 

The first strongly polynomial algorithm for the submodular flow problem was given by 
Frank and Tordos [58]. This algorithm relies heavily on the ellipsoid algorithm for min- 
imising a submodular function. 

1.2 P r eliminar ies 

In this section we give basic definitions and results from submodular, bisubmodular theory, 
graph theory, complexity theory and formulations of minimum cost flow problem and its 
submodular version. 


Submodular systems 

Let E be a non empty finite set and D be the collection of all subsets of E. D is a distributive 
lattice with set union and intersection as the lattice operations, that is, for any X,Y G D 
we have XuY and X OY e D. 

Let f : D 5R./ is said to be submodular on D if 


yX,YeD: f{X) -b f{Y) > f{X UY) + f{X D Y). 
The pair (/, D) is called a submodular system. 

i ('f'' 



/■ 
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Definition 1.2.1. A family C D is called an intersecting family if for each intersecting 
X,Y & ( that is X,Y G IF and X OY ^ (jf) we have X UY,X f\Y G F. A submodular 

function f : F R on the intersecting family f is called an intersecting submodular 
function. 

A family F C 2-®' is called an crossing family if for each crossing X,Y G F 
( that IS X,Y G F and X r\Y ^ (j>,X — Y ^ f),Y — X f and X U Y E) we have 
X UY, X nY G F. A submodular function f : F U on the crossing family F is called 
a crossing submodular function. 

Let f be a submodular function defined on D. 

O' 

Define a polyhedron in^R^^ as: 

P{f) = [x\xGn^,'iX GD:Y,xi< f{X) 

I ' 

P{f) IS the submodular polyhedron associated with submodular function f. 

Let us define the base polyhedron B{f) associated with P{f) as, 

B{f) = {x\xGPif), x{E)=f{E)} 

Let V G P{f). A set B Q E IS said to he a v-tight set if v{A) = f{A). 

V G P{f) IS said to be an extreme point of P{f) if it can not be expressed as a convex 
combination of the other two points in P{f). 

Note that the value of / on the empty set need not be zero. Where ever it is so, it will be 
specifically mentioned in the thesis. 

Directed Sets 

Definition 1.2.2. Directed set: Let E be a finite set with cardinality of E=\ E \= n. A 
directed set(or diset) X = {Xi,X 2 ) of E is an ordered pair of disjoint subset of E, that is 
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XiQ E and X2 C E and Xi 0X2 = We also identify a diset of X = (Xi,X2) of E 
with a n-dimenszonal ( 0 , + 1 ) vector as follows: 

' lifeeXi 
X{e) = i -1 ife e ^2 
0 otherwise 

Let D(E) denote the collection of all directed subsets of E. We will also regard the 
elements of D(E) as collection of all n-dimensional (0 ,+l) characteristic vectors defined 
above. 

Operations on disets:. 

Suppose X={Xi,X2) and Y={Yi,Y2) are two disets. 

(i) X=-Y if = y 2 and X2 = Fi (ii) X <Y if X1CY2 and X2 2 li 

(iii) X ay =iXinYi,X2UY2) (iv) X V r = (Xi U Fi.Xs n ^ 2 ) 

(v) X C y if and only if C yi and X2 C Y2 

(vi) xnY = {Xir\Y,,X2nY2) 

' lifX(e) = y(e) = 1 
X n y(e) = 1 -1 if X(e) = Y{e) = -1 
0 otherwise 

V 

(vii) X u y = (Xi u yAX 2 u yj, X 2 u y 2 \Xi u yi) 

' lifX(e) + y(e) >0 

X U y(e) = 1 -1 if X(e) 4 - y(e) < 0 
OifX(e) + y(e) =0 
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(viii) X\F = (Xx\(ri U Y2),X2\iYi U Y 2 ) 


f 1 if X(e) = 1 and Y(e) =^0 


X\Y(e) = < —1 if X(e) = —1 and Y(e) = 0 


( 0 otherwise 


Definition 1.2.3. 

(1) Two disets X, Y are said to be intersecting if X (^Y, Y X and X f\Y ^ (f. 

( 2 ) Two disets X = {Xi,X2), Y = (Fi, Y2) are said to be non cancelling if and only if 
XiDYz and X2 D Fi = (j). Otherwise X, Y are cancelling disets. 

( 3 ) X=absolute(X)=XiU X2, So 


\X\ = \X\ = \X^UX2\ = \X^\ + \X2\ 

(4.) Let “T denote some collection of disets. T* is said to he an intersecting family if for 
any two disets X,Y G 7' and XnY ^ (f implies X UY and X n F € 7'. 

( 5 ) A diset X — (Xi,X2) ^s said to be an orthant if \Xi U A'’2| = jXj = n. 


In what follows we give some properties of disets with out their proofs. 


Let X,Y,Z be three disets of D(E) 



(i) 


(ii) 

XCFCX=^X = F. 

(hi) 

XUF=FUX 

(iv) 

X n F = F n X 

(v) 

X n F C X and X n F C F 

(vi) 

xnF cxnF 

(vii) 

xn(FnF) = (xnF)nF 

(viii) 

X U F = (f if and only 


(ix) (X u F) n F = {(X u X) n (F u Z)} n {(X n F) u (F n F)} 

(x) (X n F) u F = {(X u F) n (F u F)} u {(X n F) u (F n F)} 
The following properties are not true in general. 

{i)X CXUF or FCXUF 
{ii)iX U F) U F = X U (F U F) 



{iu)x u (Y n z) = (X uY) n (X u Z) 
{iv)X n{YuZ) = (X nY)u{Xr\Z) 


Bisubmodular Systems 

Definition 1.2.4. Let f : D(E) be a bzset function on D(E). f is said to be a 

bisubmodular (or directed submodular) function if for any X = {Xi,X2) and 
Y = (YuY^) € D(E) 

/(X1X2) + /(Fi, F2) > f{Xi u F\X2 u F2, X2 u F2 \Xi u F) + /(Xx n F, X2 n F) 

The pair (/, D{E)) is called a bisubmodular system. 

f IS said to be non- decreasing (increasing) if f{X) < f{Y) VX C F X, Y G D{E). 

Definition 1.2.5. Let f be a bisubmodular function on D{E) 

Define a polyhedron in S?"® by 

P{f) = [x\xe 5 ?^V(X,F) 6 D{E) : 

I lex leY 

P{f) is the bisubmodular polyhedron associated with bisubmodular function f. 

Let V G P{f). A set (A, B) G D{E) is said to be a v-tight set ifv(A, B) = /(. 4 , B). 

V G P{f) IS said to be a extreme point of P{f) if it is not a convex combination of two 
other points in P{f). 

In what follows is an example of a bisubmodular function. 

Let E=(l,2,3). Define / as follows: 

f{Lcj>) = f{<i>,i) = l, z = l, 2,3 

/(U, 4 >) = /(0, ij) = fii,j) = 2, i,j = 1, 2 , 3 # j 

fiv,k) = f{(l),ijk) = f(ijk) = fiijk.cp) = 3 , z, A; = 1 , 2, 3 i ^ j 7^ A: 

/((;6, (j)) — 0, then / is a bisubmodular function. 
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Definition 1.2.6. Let X C D{E). D =-{E,X) is said to be a ditroid if the family X 

satisfies the following axiom. 

iMeX 

2J if X G X and Y C X then Y G X. 

3) if X and F € X are non cancelling with |X| = \Y\ 4- 1, then there exists an e^ G E and 
Z GX such that 

Y{ef) = 0 , X(ej) = X{ei) ^ 0 and Z(ej) = Y(ej) for all j ^ i. 

The rank function of ditroid D = {E,X) is a set function h : D(E) -> 7t+, defined by 
h{X) = hiXuXf) = max {Y-.YCXandYGX]forXG D{E). 

Note: The rank function of a ditroid polyhedron is an increasing bisubmodular function. 

Definition 1.2.7. Let O C D{E). M' = {E,0) is said to be a oriented matroid, if O 
satisfies the following properties. 

(1) For all X G O, X ^ f) implies —X G O, Y C X implies X = ±Y. 

(2) For all X = (Xi, Xf) and (Fi, Yf) € O, such that X ^ — F and for all 

e G (Xi n ^ 2 ) U (X 2 n Fi) and e' G {X 1 /Y 2 ) U (X 2 /Fi), there exists Z = (Fi, F 2 ) G O, 
such that Zi C (Xi U Fi)/e, Z 2 C (X 2 U Y 2 )le' and e' G Z. 


Graph theoretical Notations and Definitions: 

A directed graph G=(V,A) consists of a set V of n nodes and a set A of m arcs whose 
elements are ordered pairs of distinct nodes. 

A directed network is a directed graph whose nodes and/or arcs have associated nu- 
merical values (typically, costs, capacities, and/or supplies and demands). We often make 
no distinction between graphs and networks. Let us use the term graphs and networks 
synonymously. A undirected graph is defined in a same way as a directed graph/network 
except that the arcs in an undirected graph are unordered pairs of distinct nodes. 
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A directed arc (ij) has two end points i and j. Node i has been referred to as the tail of 
the arc (i,j) and node j as its head. An arc (ij) is incident to nodes i and j. 

An arc (i,j) is an outgoing arc of node i and an incoming arc of node j. The forwardstar 
F(i) of node i is the set of arcs emanating from node i and the backwardstar B(i) is the 
set of arcs terminating at node i. 

Let us assume that are no loops and multi arcs. 

A walk in a directed graph G=(V,A) is a subgraph of G consisting of a sequence of nodes 
and arcs ii—ai-i<2— a2...V-i - Or-i — V satisfying the property that for all 1 < ^ < r — 1 
either ak = (zfc,2ifc+i) G A or Ck = {ik+i,ik) € A. A directed walk is a walk in which every 
two consecutive nodes ik and Zk+i are connected by the arc 
A path is a walk in which no node is visited more than once. 

A cycle is a path ii — Z2...v together with the arc (^r, ^i) or (zi, v) 

An arc (i,j) in a path/cycle is a forward arc if the path/cycle visits node i priori to visiting 
node j,and is a backward arc otherwise. 

A directed path/cycle is a path/cycle in which every arc is a forward arc. 

A graph is acyclic if it contains no directed cycle. Two nodes i and j are said to be 
connected if the graph contains at least one path from i to j. A graph is connected if every 
pair of its nodes is connected. Otherwise the graph is disconnected. 

Minimum cost Network flow problem 

Let us now present a mathematical programming formulation of the minimum cost flow 
problem (MCF). Let G=(V,A) be a directed network defined by a set V of n nodes and 
a set A of m directed arcs. Each arc (i,j) G A has an associated cost Cjj that denotes the 
cost per unit flow on that arc. We assume that flow cost varies linearly with the amount 
of flow. The amount of flow on each arc (i,j) G A is bounded below by ly and bounded 
above by Uy. With each node z G F, is associated an integer number b(i) representing its 
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supply/demand. If b(i)>0 ,node i is a supply node. If b(i)<0 then node i is a demand 
node. If b(i)=0 then it is a transshipment node. Flow on arc (i,j) e A is represented by 

The optimization model is as follows, 
min 

subject to E E Xij = b(t) Vi e V 

3 3-(3,^)^A 

I13 ^ ^ '^13'^ j'} ^ -A 


where ^ 6^ = 0. 

1 

In matrix form, the minimum cost flow problem is as follows. 

Minimize cx 
subject to Nx=b 

I < X <u, where N is a n x m matrix called the node-arc incident matrix. 

Minimum cost submodular flow problem 

Let us now present a mathematical programming formulation of the minimum cost sub- 
modular flow problem. 

Let G =(V,A) be a directed graph with node set V and arc set A. Multiple arcs are allowed 
but loops are not. 

A set B C y is said to be a (ij) set if z € 5 and j ^ B. An arc (i,j) leaves(enters) B CV, 
if B is ij-set (ij-set). Define a modular function as follows. 
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Xx(B) — ^ Xij ^ Xtj 

(ij) enters B (i,j) leaves B 

Where x G 3?^ and 5 C F 

Let c G that is c is a real weighting of the arcs. Let U G (0, oo) denote the flow upper 
bound vector. With out loss of generality lower bounds can be assumed to be zero. Let / 
be a crossing submodular function on the crossing families of subsets of V. 

The mathematical formulation of the mincost submodular flow problem is as follows 
Minimize cx 

subject to Aj;(5) < f(B), B CV 
0 <x <U 

A feasible solution x to the above problem is called a submodular flow and the set of all 
feasible solution is a submodular polyhedron. 

Shortest path problem 

The problem is to find a path of minimum cost (or length) from a specified node s to another 
specified node t, assuming that each arc (i,j) G A has an associated cost (or length) Cj^. It 
is a special case of the minimum cost flow problem. 

1.3 Out line of the thesis 

In Chapter-2 an attempt has been made to study the properties of the submodular func- 
tions having non zero values at the empty set. When the value of / at the empty set is 
nonnegative the greedy algorithm will yield an optimal solution to the Linear programming 
problem (LPP) associated with P{f). In this chapter it has been shown that the greedy 
algorithm may fail to solve the LPP when the submodular function has negative value at 
the empty set. We examine the reasons why the greedy algorithm fails. An algorithm to 
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solve the LPP when f{(i>) < 0, using minimizing the submodular function as an oracle is 
given. Some examples where submodular functions with negative values at the empty set 
occur are given . It has been shown that the system 

{max cx : X E P(f), f{4>) < 0} is also totally dual integral. Connectedness property has 
been so for defined for the submodular systems having = 0. We make an attempt to 
extend it to a general submodular systems. We also show that for a submodular function 

with f{(f)) < 0, it is possible that P{f) ^ 4> but B{f) the corresponding base polyhedron 

, 1 

is empty. Lovasz [42] gave a method for extending a set function to all of 7^ and proved 
that the extension is convex if and only if the set function is submodular. This result also 
requires that the set function must be zero at the empty set. We modify this result for 
functions with /(0) < 0. 


i 


In Chapter-3 properties of bisubmodular functions with negative values at the empty 
set have been studied. The relationship between greedy algorithm and bisubmodularity 
was first pointed out by Kabadi and Chandresekeran [12]. They have shown that when 
/(^, = 0, the greedy algorithm implies bisubmodularity and vice versa. In this chapter 

an attempt has been made to study the relationship between bisubmodularity and the 
greedy algorithm when f{<t),4>) ^ 0. The question we ask is: Will bisubmodularity 
validity of the greedy algorithm? The answer is ’’not always”. 

The LPP associated with bisubmodular polyhedron can be solved by the generalised greedy 
algorithm when the value of / at empty set is nonnegative. But when /(^, (/>) < 0 the gen- 
eralised greedy algorithm may fail to produce even a feasible solution. The LPP associated 
with a bisubmodular function having negative value at the empty set has been studied in 
detail. We have shown that it is not easy to solve the LPP associated with such functions. 
However, we give an efficient algorithm for solving the LPP, 

[max cx: xe x(A, B) < f(A, B), (A, B) C (X, Y)} 
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where X = {e-i I Cg, > 0}, F = {ej 1 Cg, < 0} and /(^, 0) <•{). 

Some of the existing theory for bisubmodular functions assumed to have zero value on 
empty sets is extended to functions having nonzero values at the empty set. 

In Chapter-4 we study the bisubmodular systems. 

First we prove some properties of increasing bisubmodular functions and their associated 
polyhedra. We then extend these concepts to a general bisubmodular function and its 
associated polyhedron. We have shown that many properties of the polyhedron asso- 
ciated with increasing bisubmodular function and polymatroid are similar. 

It has been shown that the tight sets corresponding to the non zero components of n G P{f) 
are non-cancelling. This property helps us to define support function, closure function and 
the partial order for the extreme points, when / is increasing. 

We prove the Min-Max theorem for the bisubmodular system, and our proof differs from 
the existing ones. Relationship between separation problem and bisubmodular function 
minimization is also given. At the end we minimise bisubmodular functions with respect 
to some simple constraints, that occur in practical problems. 

In Chapter-5 we develop a variant of cycle-cancelling algorithm for solving minimum cost 
network and submodular flow problems. The cycle-cancelling algorithm is credited to 
Klein [39]. Polynomial and strongly polynomial variants of this algorithm was developed 
by Goldberg and Tarjan [30], Barhona and Tardos [6] and Wallacher and Zimmerman [55]. 
Our cycle-cancelling algorithm for the minimum cost network flow performs the following 
steps on the residual network. We look for augmenting cycles as well as dynamically update 
potentials of the nodes blocking the search. The basic idea is to reduce the optimality vio- 
lation of all arcs incident to the node blocking the search by half. The important feature of 
our algorithm is that it simultaneously finds negative cycles and tightens optimality viola- 
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tions by using the basic graph search algorithm. The complexity of our algorithm matches 
the complexity of Sokkalingam [52]. This algorithm has also been extended to solve the 
submodular flow problems. A study on computational behaviour of our algorithm has also 
been carried out in this chapter. Computational experimentation has been the primary 
method for comparative evaluation of network flow algorithms. We compare our algorithm 
with the relaxation algorithm for solving minimum cost network flow problem by Bertsekas 
and Tseng [8]. The test problems used are the standard bench marks generated by the 
pubhcally available netgen program [40]. 

At the end we give concluding remarks, and highlight scope for future research. 



Chapter 2 


On Submodular functions having non 
zero values at the empty set 

2.1 Introduction 

Submodular functions with non-zero values on empty set occur while determining parti- 
tions of a submodular function / — A where A varies from — oo to +00 Such partitions are 
required while analysing decomposition of electrical networks [44]. Such functions also oc- 
cur when submodular functions defined on intersecting and crossing families are extended 
to the family of all subsets. Also parametric analysis of capacity constraints in submodular 
flow problems gives rise to submodular functions having non-zero values on empty set. 

We show that for submodular functions with negative values on empty set the greedy algo- 
rithm may fail to solve the associated LPP. Also the nestedness property of the associated 
polyhedron may fail to hold. We give a efficient algorithm for solving an LPP over the 
polyhedra defined by such submodular functions using an algorithm for minimizing a sub- 
modular function as a oracle. We also study the total dual integrality, Lovasz extension 
and connectedness of such functions. 

Definition 2.1.1. Let E=(ei,e 2 ...en) be a finite set and let f be a submodular junction 
defined on D, where D is the collection of all subsets of E. 

Letp{f) be the submodular polyhedron associated with f. 
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Through out this chapter we will assume that an oracle is available which returns the value 
of f(A) for any set A C E. 

2.1.1 Linear programming problem LPP 

For a submodular polyhedron P{f) linear programming problem is defined as follows: 


(P) Maximum ^ 

1=1 

subject to X E P{f) (2-1-1) 

where c € is a weight vector corresponding to the elements of E. 

The dual of problem (2.1.1) is 


(D) Minimum E f{A)YA 
subject to 

e.€52^ e P, 1 a > 0, VA € P (2.1.2) 

If f[(p) = 0, a greedy algorithm as in [19] for solving ( 2 . 1 . 1 ) is as follows. 

Order the elements of E according to Ci > C 2 ... > c^- The vector x G 'RP solves (2.1.1) if 
x^ = /(ei, e 2 ...ei) - /(ei, 62 , V z = 1, ...n. 

The greedy solution vector x also represents an extreme point of the polyhedron P(/). 

In case /(0) ^ 0, it is not necessary that the greedy algorithm will solve (2.1.1). 

In fact, the case, f{4>) G (0, 00 ) creates no problem for the greedy algorithm, but if /(o) G 
(— 00 , 0) then the greedy algorithm may not even obtain a feasible solution. 

Example 1. Consider the following example. 

Let E=(l,2,3) and f be a submodular function defined on subsets of E. Consider the linear 
programming problem defined on P{f) as follows: 



17 


Max X1+X2 + XZ (2.1.3) 


Xi 

< 

3 

Xi -{■ X2 

< 

6 

X2 

< 

2 

2:2 +2:3 

< 

8 

X3 

< 

4 

Xi -|- 2:3 

< 

5 

m 

= 

-2 

Xi -hX2 + X3 

< 

8 


The greedy algorithm will produce the point (5,3,2) as the solution to (2.1.3), but we see 
that it is not a feasible solution as it violates the constraints xi + 5:2 < 6 ,a;i + 3:3 < 5 and 

X1+X2+XZ < 8 . 

It is easy to verify that the point (5,3,2) is an optimal solution to the problem with / 
replaced by /, where f{A) = f{A) — f{(t>) VA C E 

But one still does not obtain the optimal solution (3,2,2) for the problem (2.1.3), since the 
two polyhedra P{f) and P{f) are totally different. 

In what follows, we investigate the reasons for which greedy algorithm does not work for 
the case when f{(f)) < 0 

Definition 2.1.2. Let P{f) be a submodular polyhedron and x 6 P{f), then a set A is 
said to be tight with respect to x or x-tight if x{A) = f{A) 

Fact: Let / be a submodular function and /(0) =0 Then AuB and ADB are x-tight 
whenever A,B are. 

Lemma 2.1.1. Let f {<())< 0 and x €. P(f). Let A,B be two x-tight sets. AuB andAOB 
are x-tight if Ar\ B ^ ft. If AD B = <f then the union and intersection of tight sets need 
not be tight. 
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Proof. Since x G P{f) and A, B are x-tight x{A) = f{A) and x(B) = f{B). 

=> x{A {J B) + x{A n S) = ^(A) + x{B) = f(A) + f{B) (2-1.4) 

But, 

f(A) + f{B) > f{A UB) + f{A n B) (2.1.5) 

If AnB = (f) and f{(p) < 0 , it is possible that 
f{A) + f{B)<f{AuB) 

xiA UB)= x(A) + xiB) = /(A) + f{B) < /{A U B) and 
x(A n B) > Thus A U B and (A n B) are not x-tight sets. 

If A n B / (j) then it follows from (2.1.4)and (2.1.5) that A U B and A n B are both 
x-tight. □ 

Note; In case f{(j)) > 0, it follows from the above lemma, that (A n B) and (A U B) will 
both be x-tight sets. 

Definition 2.1.3. A B is said to be nested if its columns can be permuted to 

obtain a matrix B that satisfies the condition 

By = 1 implies Bik = 1 Vz, j and \/k < j. 

Definition 2.1.4. Linear programming problem (2.1.1) is said to have nestedness property 
if for any c G TZP such that the dual (2.1.2) has an optimal solution, there exist an 
optimal dual solution y* such that the row submatrix of the coefficient matrix of (2.1.2) 
corresponding to the positive components of y* is nested. 

Definition 2.1.5. Given a polyhedral set B = {x • Ax < b}, P C P is a face of P if 

there exists a vector c such that 

max{c^x : x G P} is attained precisely over P. 
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Theorem (2.1.1) below relates nestedness property of the LPP (2.1.1) with 
submodularity of the right hand side function /. 

Theorem 2.1.1. (Kabadi,R.chandrasekaren [37j) 

For the LPP (2.1.1), when = 0, the following statements are equivalent. 

(1) (2.1.1) has the nestedness property. 

(2) Greedy procedure yields an optimal solution to (2 1.1) \f c >0. 

(3) The function f is submodular. 

Lemma 2.1.2. Theorem (2.1.1) does not hold when f{<i>) < 0. 

Proof. We reproduce the proof from [37] to show that even when f{4)) < 0, (1) (2) and 

(2) (3). 

(1) (2) If necessary in the LPP(2.1.1) perturb the elements of c so that all elements are 
distinct. Let P be a minimal face of the polyhedron P(/) over which the maximum value 
of the objective function (2.1.1) is attained. Let D denote the nested active matrix corre- 
sponding to the set of constraints defining P. From duality theory of linear progr ammin g 
we know that there exists y > 0 such that y is optimal for the dual and D^y = 0. Since 
D is nested it follows that Ci > C 2 > ..Cn, and the greedy algorithm produces an optimal 
solution for (2.1.1). 

(2) ==» (3) For any A, B C E, define a vector c as 

(2 if e^ €: (.4 n B) , 
c^ = < 1 if e^ €. (AU B) — {An B), 

[ 0 otherwise 

Let X* denotes a greedy solution to (2.1.1). By definition of the greedy procedure 
a:*(A n P) = f{A n B), if (A n P) 7 ^ 

=0 otherwise and 


x*{A{JB) = f{AuB). 
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It thus follows that 

f{A) + fiB) > x*iAU B) + x^iAn B) 

= f (A U B) + f (A n B) if (.4 0 B) ^ d) 

= f(A u B) + 0, if n B) = <^ 

Hence f{A) + f(B) > f{AUB)+f(AnB), that is / is a submodular function. 

It has already been shown through example (1) that submodularity of / does not guarantee 
that greedy procedure yield an optimal solution and that corresponding LPP have the 
nestedness property. So (3) (2) and (3) ^ (1) □ 

Consider the dual of the LPP in example (1) 
mm { 3 yi + 2y2 + Ayz + 6^4 + 82/5 + Sye + Sy?) 

Subject to 

yi + y4 + ye + y? = 1 
yi + y4 + ys + y? = 1 
yi + ye + ye + y7 = 1 

yt > 0 

The optimal solution is 

y* = (0. 1, 0, 0, 0, 1, 0) and the coefficient matrix corresponding to y* is 

/0000010\ 

0 1 0 0 0 0 0 

Voooooloy 

which is not a nested matrix. 

2.1.2 Algorithm for solving LPP over P(/), when /(0) ^ 0 

For the LPP defined in (2.1.1), the greedy algorithm will also obtain an optimal solution 
in case /(^) > 0, since one can always treat the value of / at empty set as zero. However 
it is not possible to do so when /(0) < 0. 
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For solving the problem (2.1.1) with we assume that an oracle for minimizing a 

submodular function is available with complexity (k). we also need the following results. 
With out loss of generality let as assume that the value of / at all singletons are zero. If 
the given function / does not satisfy this property then convert / into another function / 
so that it satisfies the above property. One can do this by subtracting a modular function 
w, where w{e) = /(e) for every e £ E. 

That is for any ACE, let f{A) = f(A) - 

Lemma 2.1.3. Solving the LPP over P{f) is equivalent to solving it over P{f)- 

Proof. This is just a translation of P(/) by a vector w={/(ei), /(e 2 ), .../(e^)} and hence 
from the definition of / it follows that if a; € P(/), then x + w£ P{f)- 
^ P{f) = P{f) - {tu}, 

Thus if {xi,X 2 ...Xn) is an optimal solution for an LPP over P{f), then {xi + w{ei),X 2 + 
w{e 2 ), ...Xn + w{en)} will be an optimal solution for the same LPP over P{f). □ 

From now on we will refer to LPP (2.1.1) with the function / having the propert}' that it 
has zero values on singletons. 

Lemma 2.1.4. To find a non — empty minimising set of a submodular function f, the 
oracle for minimising f may be used almost [Pj times. 

Proof. Apply the oracle for minimising / and if the empty set is the minimising set then, 
let E' = E- {i} 

Define a function f^ on E' as follows 
/(A) = f{A iJi)'i ACE' 
ft is submodular. 

Let At be the minimising set for fi and let 

fk{Ak) = ™“(/i(At)). Then Ak is the non-empty minimising set for /. □ 
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Let us first solve a simpler case of the LPP (2.1.1) where c’s are all 1. Then by using this 
algorithm we can solve (2.1.1) for any c > 0 We use the approach in [44] to solve problem 
(2.1.1), when all c^’s =1. Intialy find the set A that minimizes /. If f{A) > 0, the right 
hand side vector in (2.1.1) is non-negative and the constraints < 0 for singletons 

n 

=> max ^^1 = 0 

l=l 

and hence rcj = 0 Vz is the optimal solution. 

If f{A) < 0, since f{B) > f{A) MB C A, it follows that maxY^Xi = f{A). We now 
treat the set A as a singleton and repeat the above piocess for the set E' = {E/A) U {A}, 
that is replace / by f on E' such that it has zero values on singletons, and again find 
the non-empty minimising set with respect to E'. Repeat this process until E = (j) ox the 
minimum value of the current submodular function is non-negative. 

We now give a concise version of our algorithm. 

The following procedure will find a non empty minimising set A with respect to some 
subset BCE, for a submodular function / such that f{A) < 0. 

Procedure {Min — Neg{f, E)) 

Begin 

set {B = <f>) 

Step 1: li e E E/B then B = BUe 

else go to Step 3 
Step 2: f{A) = min^cxcB f{X) 

If / {A) < 0 then return A 
else go to Step 1. 

Step 3: Return {(/)) 


end 
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In Step-2 one has to use the oracle for minimising a submodular function together with 
lemma (2.1.4). 

Note 1. If the procedure Min-Neg returns 4> at the first iteration itself then Max Y^Xi = 0, 
hence Xi = X 2 = ■■■ = = 0. 

Main Algorithm 
i=0; 

Begin 

Step 1: i=i4-l; 

A=CALL (Min-Neg(/,.e)) 

If (A^ = fi) then go to Step 3 
else go to Step 2 

Step 2: Set E' = ({A} U ^/(A)) 

(Treat the set A as a singleton in E'. So | £■' |=| E/Ai | -1-1. 

Now define a submodular function f on E' such that fiAf) = 0. 
that is consider a modular function w on E' such that 
^([^D = fW and w{e) = /(e) Ve € E/Afij 
Now f = f — w, f is submodular and /'(A) = 0 
set f = f and E = E' 
go to Step 1 
Step 3; STOP 
end. 

This algorithm terminates only when the procedure Min — Neg(f, E) returns the empty 
set. 


At the end of the algorithm we will have 

E' = ([A,i[A2i-[AME/Ul.,A:i), 
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2 :e = 0 Ve G {Ej uf_i A^) and 

x^ = f{Ai), ... ^ X, = f{Ak) 

Ci^Ai CiGAk 

So 

k 

Max E^. = E/(^)+o- 

SxGE 1=1 

which is the optimum value. 

^ The validity of the algorithm follows easily. We first show that the vector x obtained at 
the end of the algorithm is in P(f'), where the ground set 


Let A be any subset of E'. The algorithm terminates when, for every subset of E', the 
corresponding value of f' > 0. Also for the singletons in E', the corresponding 
f value is 0. Therefore Vt, Xj < 0. 

Thus < 0 < X e P{f') 

If Wi,...,Wk are the k-modular functions introduced during the course of the algorithm, 
then from lemma-2.1.3, it is clear that x + wi-\- ... +Wk € P{f). 

As for the outcome x being an optimal solution, suppose there exist a x* 6 P{f') such that 

max E < > E 

CiGE e-iGE 

Since the value of /' at singletons are zero it follows that 


E 


x' < 0 


e,e£;/ujLjA. 


therefore for some Aj 

E < > E = fiA) = 0 

CxGA^ e, GAj 

that is x*(Aj) > 0 which is a contradiction. Therefore the algorithm obtains an optimal 
solution. We can now solve the LPP (2.1.1) for any c > 0. 
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Using the above algorithm, replace / in ( 2 . 1 . 1 ) by / where 

f(A) = [Max xe P(/) ) C E. (2.1.6) 

V tea / 

/ is submodular and its value at the empty set is zero. 

Lemma 2.1.5. P{f) = P{J) 

Proof. Let x 6 P{f) we show that x G P{f) 

X G P{f) 

^ x{A) < f{A)\/A e 2 ^ ,But 

f{A) = Max Yxz-xe P{f) 

tea 

So 

Max ^ ^ aij = x(A) 

tea tea 

^ x(A) < f(A) 

=^TeP(f), so P(/)CP(/) 

The converse part is of course obvious. 

So P(/) = P(/). □ 

We can now apply the greedy algorithm to solve ( 2 . 1 . 1 ) over the polyhedron P(/). 

Note that after ordering the cfs as ci > C 2 > ... > Cn, we need to calculate / for the n sets 
{(ei), (ei, e 2 )...(ei, 62 , ...e„)} only. 

Complexity 

Complexity of finding a non-empty minimising set requires the oracle for finding a minimum 
set atmost jPl times. To find [MaxY.x^ :x G P(/)) we saw that the procedure min- 
neg(/,E) has to be used atmost n-times. Thus the complexity of solving problem ( 2 . 1 . 1 ) 
when all Cj’s =1 is o(n^k). Finally to solve ( 2 . 1 . 1 ) for arbitrary c > 0, we use the greedy 



26 


algorithm. This would require calculation of / for n sets, thus the total complexity of our 
algorithm for solving LPP (2.1.1) is o{n^k). 

2.2 Examples 

In this section we discuss some problems where submodular functions with f{(j)) < 0 occur. 

Extension of submodular functions 

In [42] Lovasz considers the extension of submodular functions defined on intersecting and 
crossing families to the family of all the subsets. We show that such extensions will give 
rise to submodular functions with negative values on empty set 
Let he a, intersecting family with <f) 

Let Si,S 2 ---Sk be the maximal members of the intersecting family T 

Now for VX ^ E ,X ^ (p, contains a unique smallest subset containing X. Let it be 

denoted by X. 

Define 

( — c X n Si = (j) 

Where X fl is the unique smallest set in JF that contains (X” D Si). Now the extension 
of / is defined as follows 

f{X) = j^fi{X) +{k-l)c VXCF 
2=1 

where c is a large number. 

Now f{p)=-kc +kc -c= -c < 0. 

Extensions of submodular functions defined on crossing families will also lead to submod- 
ular functions having negative values on empty set. 
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Parameterization of submodular flow r.onstraints 

We will define the submodular flow problem as follows: 

Given a digraph G(V,E) with node set V and arc set E 

xoin'Y^c.jXij (2.2.1) 

subject to 
X,{A)<f{A) ACV 

where 0<x<g and (A) = ZjeA - Ez 6.4 

/ is a crossing submodular function on a crossing family F of subsets of V and g 6 (i?Uoo)'®. 
A feasible solution to (2.2.1) is called a submodular flow. Here f{(f)) = 0. 

The following is a well known result in [27]. 

Theorem 2.2.1. If f is a submodular function on a crossing family F with (j),V € F and 
f{4>) — 0 the corresponding polyhedron Pif) is 

P{f) = (x/x € VX 6 F xiX) < f(X)) 

If P{f) ^ (j) then there exits a unique submodular function f such that P{f) = P{f'), und 
f is defined over all subsets of V. Also f'{4>) = 0. More over if f is integer valued so is 

f- 

Thus m problem (2.2.1) we can replace / by a fully submodular function f. Parametri- 
sation of the flow constraints will give rise to submodular functions f' — X, A G (— oo, oo). 
These functions will have non-zero values on the empty set. 

LPP and Partition problems 

Let / be a submodular function with /(^) = 0. Now consider the function f = f — X, 
where A G (— oo, oo) U 3?. Clearly / is a submodular function and f{(f>) = /(^) — A = —A. 
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First consider the case when A € (—00, 0 ], the LPP 

Max cx : X e P{f)^ ( 2 . 2 . 2 ) 

can be solved by the greedy procedure. 

Lemma 2.2.1. Suppose (xi, X2, ■■■Xn) is the greedy solution for problem ( 2 . 2 . 2 ) for A = 0 
then (xi — X,X2...Xn) is an optimal solution for all A G (— oo, 0 ]. 

Proof. Define g(A) = f{A) ycf) ^ A C E and g{(j)) = 0 . 

It can be immediately verified that P{g) = P{f). 

We now apply the greedy algorithm to (2.2.2) with / replaced by g, and let x' denote the 
outcome of the greedy algorithm, then 
xi = g{l) - g{ 4 >) = /(I) - A- 0 = Xi-A 
xi = 5(1, 2) - ^(1) = (/(1, 2) - A) - (/(I) - A) = X2 

< = ^( 1 , 2...n) - ^(1, 2...n - 1 ) = (/( 1 , 2 ...n) - A) - (/(1, 2 ...n - 1 ) - A) = Xn □ 

The case A G ( 0 , 00), has already been discussed earlier and we have shown that the point 
(xi — A,X2...,Xn) may not be a feasible solution for (2.2.2) in this case. 

Partition problem 

In electrical networks analysis by decomposition [ 44 ] we come across the problem of finding 
a partition tt of E, that minimises 

= E 7^W) 

That is finding the 

mjn (/ - A(7r)) 
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where / is a submodular function on the subsets of E with /(0) =0 and tt is a partition 
of E and M’s are the components of the partition tt and A takes values between 
—CO and oo. 

consider the dual of problem (2.2.2) with Cj = 1 Vz 
minY^y{A)(f{A) - A) 

A 

subject to yi^) = 1 2 = 1) •-22- 

e,€A 

The optimal solution to the dual will give the minimum partition of E with respect to the 
submodular function (/ — A) and solutions to the dual as A varies from (— oo, co), will give 
the principle partition lattice [44] of the submodular function / 

Note 2. (a) For A G (— oo,0) f{(p) > 0. In the earlier section we have shown that the 

LPP (2.2.2) can be solved by the greedy algorithm for all f for which f{d)) > 0. Thus for 
the case when all ’s=l from linear programming duality the optimal solution to the dual 
problem will be y(E)=l and all other y^ = 0, C E. That is, for all A 6 (— oo, 0), E itself 
IS the minimum partition for / — A. 

(b) For X E (0, oo),/(^) < 0 and the primal problem corresponding to the partition problem 
can be solved by the algorithm discussed in the previous section. During the course of this 
algorithm we also obtain the minimum partition of f — X. 

2.3 Properties of submodular functions with f{(l)) < 0 

Base Polyhedron 

Let P{f) = (x : x{A) < f{A) V .4 G 2^^ be the polyhedron associated with /. 

Let us define the base polyhedron as 

B{f) = (xEn^ :xE P{f) and x{E) = /(E)) 

If /(0) = 0 then the base polyhedroiiE(/) will be nonempty wheneverP(/) is nonempty. 
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But this is not true in the case when f{4>) <0 It is possible that the base polyhedron is 
empty even though P{f) in non empty 

Suppose there exists two sets A,B such that = and A\J B = E and 

f{A) + f{B)<f{AuB)=f{E) 

Then 'ix e P{f) 

x{A) + x[B) < f(A) + f{B) < f(E) 

this implies that 

^ < f{E) Vx e P(f) 

zeAUB 

Thus B(f) may be empty, even if P(f) is non-empty. 

That is one cannot always define the base polyhedron for a submodular function 
with /((/!>) <0 

Definition 2.3.1. Let f be a submodular function on the subsets of E with f{<i)) — 0. Now 
define a function f* on the subset system {E/X,X C E) as follows: 
nE/X) = f{E)-f(X). 

Clearly f* is supermodular and f*{<p) = 0. 

In [27] it has been shown that B{f) = B{f*) when ever f{(j)) = 0 
Consider the above statement with /(0) < 0. In this case the statement need not be true. 
Since /(^) < 0, B{f) may be empty. But since /*(^) = 0 for any submodular function /, 

B{n ^ 4>. 

Hence B{f) may not be the same as B{f*) when f{(j)) < 0 

Submodular Function Minimisation 


An oracle for minimising a submodular function is an integral part of most of the combi- 
natorial optimization problems defined on a submodular polyhedron. 
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In case f{(p) < 0, we can apply the existing algorithms for minimising a submodular func- 
tion for which /(^) = 0, to the function / — /(0), since the minimising set for / — f{4>) is 
also the minimising set for /. 

The Min-Max theorem for / submodular, when f{(j}) =0 is as follows: 

max(y(£') : y e P(/), y < 0) = min (/(X) : X € 2^) (2.3.1) 

For proving this result the property that union and intersection of tight sets are also tight 
is used. Since this property does not always hold when /(0) < 0, the above Min-Max 
theorem may not always be valid for the case f{(j)) < 0. 

Replace / by the function f = f - /(0) m the equation (2.3.1) and the corresponding 
mm-max theorem is 

max (y{E) : y e P{f), y < O) = min [f{X) : X € 2^) 

Suppose minimum on the right-hand side occurs for a set ^4 C E, as mentioned earlier A 
will also be the minimising set for /. 

Total Dual Integrality 

Theorem 2.3.1. The system (Max X) : x G P{f), <0) is totally dual Integral. 

Proof. Define a function / as follows: 
nA) = (MaxEx^:xePif)) 

By the earlier lemma one can see that P{f) = P{f) and the system 

{Max X) c^x^ : x G P{f)) is totally dual Integral. So the given system is totally dual 

integral. Thus the TDI property is not affected when /(^) <0. □ 

Theorem 2.3.2. Let f,g be any two integer valued submodular function with /(<?!>) 
and y(0) < 0. Let P{f) and P{g) be the corresponding submodular polyhedron then 
P{f)nP{g) is TDI. 
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Proof. Let f{A) = max{X; a;(.4) : x e P{f)} 
g{A) = max{Z x(A) ■ x 6 P{g)} 

Now P(/) = P(/) and P(g) = P{g) 

So P(/) n P{g) = P(/) n P(p) 

Now f{4>) — g{(f)) = 0, so P(/) n P(^) is TDI by [42] and hence P(/) n P{g) is TDL □ 

Connectedness 

In this section we investigate the connectedness for a submodular system (/, D) 
when f{(j)) < 0 . 

Definition 2.3.2. Let (/, D) be any submodular system on E with f{<p) = 0 

We say that the submodular system is connected if there is no nonempty proper subset X 

of E such that 

f(X) + f{E - X) = f{E) 

Definition 2.3.3. Let B{f) be the base polyhedron, then for any x € P(/), 
let OF = {X I Xis x-tight set} . Define A^ = {nA' \ e G X, X G P"} 

Define a graph G(x)=(E,A(x)), where E is the vertex set, and A[x) = {(e, e') j e' G A%}. 
If {f,D) is connected then the graph G(x) is connected for any x G B[f). 

We now examine connectedness of the submodular system (/, D) when f{(j)) < 0. 
Definition ( 2 . 3 . 2 ) is no longer valid in this case. 

For example consider the following submodular system (/, D), where 
P = (ei, 62 , 63 ), f{ef) = 2, i=lto 3, /(ci, 62 ) = 6 , /(e 2 , ef) - 8 , /(ei, 63 ) = 5, 

/(ei, 62 , 63 ) = 6 and f{f)) = -4. 

No proper subset X of (ei, 62 , 63 ) satisfies the condition 

f{X) + f{E — AT) = f{E). Thus this system (/, D) is connected. 

But the graph G( 2 , 2 , 2 ) is not connected as shown below. 
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X = (2, 2, 2) G B{f) and = {(ci), ( 62 ), (es), (d, 62 , 63 )} 

.4^1 = {(d)}, = {( 62 )} and = {( 63 )} 

^ A{x) = {4>), and hence G(x) is a disconnected graph. 

So. the definition (2.3.2) may not hold for a submoduiar system with f{(p) < 0. 

We will define the connectedness as follows: 

Let (/, D) be a given submoduiar system with /(<p) < 0 Define a submoduiar function / 
such that 

J{A) = {max x{A) : x e P{f)) 

Definition 2.3.4. Let (/, D) be any submoduiar system on E with f{4>) < 0 We say that 
the submoduiar system (/, D) is connected if there is no nonempty proper subset X of E 
such that 

f{X) + f{E -X)= f{E) 

In the theorem below we give necessary and sufficient conditions for a submoduiar 
system to be connected in terms of the corresponding base polyhedron. 

Theorem 2.3.3. Let {f,D) be any submoduiar system with f{(t>) < 0 and B{f) be the 
corresponding base polyhedron of f. The submoduiar system (/, D) is connected if and only 
if there exists a base x G B{f) such that x{X) < f{X), for all X £ D with E ^ X. 

Proof. Let x G B{f) such that x{X) < f{X) VX £ D,X ^ E, then 

f{E) = x{E) = x{X) + x{E - X) < f{X) + f{E - X) for all X C E and X^E. 

=> (/, D) is connected. 

Conversely let {f,D) be connected. To prove that there exist a base x E B{f) such that 
x(.Y) < f{X) VX eD XAP 

Suppose not, then for any x E B{f) there exits a set Y E D such that 
x{Y) = f{Y) and x{E -Y)= f{E - Y), Y^E,6. 
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( If x{E — Y)< f(E — ), then one can decrease the values of x,'s € Y and make 

a:(Y) < f{Y) ) 

Now, fiE) = x{E) = x{Y) + x{E -Y)= J{Y) + f{E - Y). 

this contradicts our assumption that (/, D) is connected. Therefore there exists a base 
X e B{f) such that x(X) < f(X), for any X e D with E ^X. □ 

Thus one can define the connectedness for any general submodular system {f,D). 

The Lovasz Extensions of submodular functions 

Lovasz extends a submodular function / to 7?." by defining 

/'(c) = max {cx \ x ^ P{f) V c G 

Since / is submodular and Xt denotes the characteristic vector of the subset T of E, then 
by the greedy algorithm when /(/) = 0 , /'(xr) = /(Y'). 

When /(/) < 0 /'(xt) ^ f(T) and in some cases the inequality may be strict. 

From the thoery of linear programming one knows that /'(c) is convex, where / is any 
orbitary set function Vc G 'RP and hence Vc G 72." . 

But convexity of /' need not imply submodularity of / since / is not the restriction of /' 
to ( 0 , 1 ) vectors. 

Restriction of /' to ( 0 , 1 ) vectors is however a submodular function. 

We now try to answ^er the reverse question, that is when is the restriction of a convex 
function to (0,1) vectors a submodular function? Lovasz raised this problem in [42]. 
Consider the convex function /(xi, X 2 , X 3 ) = xf + x^ + xf + X 1 X 2 + X 1 X 3 + X 2 X 3 on R} 

It can be immediately verified that restriction of / to (0,1) vectors does not yield a sub- 
modular function. Consider the polyhedron P{f) obtained by restricting / to (0,1) vectors. 

If for any X C E and a convex function /, f{xx) = tnax {xxiv) ■ y € P[f)) 
then / restricted to ( 0 , 1 ) vectors will be a submodular function. 



Chapter 3 

On Bisubmodular functions having 
non zero values at the empty set 

3.1 Introduction 

Bisubmodular functions with non-zero values at the empty set occur while parameterising 
the right-hand side vector of the LPP over a bisubmodular polyhedron. Such functions 
also occur when bisubmodular function defined on intersecting families are extended to 
the family of all subsets. Also parametric analysis of the capacity constraints in the bisub- 
modular flow problems give rise to bisubmodular functions having non zero values at the 
empty set. 

We show that for bisubmodular functions with negative values at the empty set the 
greedy algorithm fails to solve the associated LPP. Also the nestedness property of the 
associated polyhedron may fail to hold. We also study in detail the LPP over the polyhe- 
dron defined by such bisubmodular functions. We also study some more properties of such 
functions and their associated polyhedra. 

Definition 3.1.1. Let E he the finite set and D(E) be the collection of directed sets of E. 
let f be a bisubmodular function from D{E) TZ.. 

Let P{f) be the Bisubmodular Polyhedron associated with f . 
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We now characterise the non-emptiness of the bisubmodular polyhedron P{f), when 
f{(p, (p) < 0- A necessary and sufficient condition for P{f) to be non-empty when 
f{d>,<P) = 0a.sm [34] is that /(A, 5) 4-/(5. .4) >0V(A.5) E D{E). A natural modification 
of the above condition for the case /(0, ^) < 0 would be that it is satisfied for ail 
(A, B) ^ (/>, (p). This is what we state in theorem (3.1.1) The proof is the same as in [34]. 
We reproduce it here for completeness sake. 

Theorem 3.1.1. Let f be a bisubmodular function with f[p,(p) < 0, Its associated poly- 
hedron P{f) is non empty if and only if V{A,B) E D{E) and (A. 5) ^ {(p,(p)- 
fiA,B) + f(B,A)>0 

Proof Let P{f) ^ (p and let x E P{f) 

/(A, B) + f{B, A) > x{A, B) + x{B, A) = 0 (Since (A. B) ^ (cp, cp)). 

Conversely, Let /(A, B) -t- /(5, A) > 0 
Let us prove this by induction. 

For |5|=1. Then 
/(e, (p) -I- f{(p, e) > 0, 
that is, /(e, cp) > -f{(p, e) 

Choose x(e) E (-f((p,e),f(e,p)) 

Then x(e) E P{f)- Hence the result is true for |5)=1 Let us assume that it is true for 
|5|=k-l. 

Let us prove for |jE'|=k. 

Define a bisubmodular function fi on E' — E — s, for some s E E as follows: 

/i(A, 5) = f{A,B) y{A,B) E D{E'). Clearly /j is bisubmodualr on D{E') and |£''|=k-l. 
So by induction there exists x E P{fi)- 
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Let 

= (A.S}S(E) B)-nA,B).s€B} 

Clearly M <m. 

Choose x{s) € {M,m). 

X* = X E Pifi) and x(s) E (M, m). 

^x* E P{f). □ 

Through out this chapter we will assume that P{f) ^ cj), P{f) is bounded and an oracle is 
available which will return the value of f{A,B) for any set (A,B) € D(E). 

3.1.1 Linear programming problem (LPP) 

For a bisubmodular polyhedron P{f), linear programming problem is defined as, 

(P) max (3.1.1) 

\e^eE ) 

Subject to X € P(/) 

where c € PA is a weight vector corresponding to the elements of E. 

Its dual can be written as: 

(P) f{A,B)Y^A,B) (3.1.2) 

Subject to 

T] y\ 

iA B)eD(E) ^{A,B) ~ {A.B)eD{B) ^{A,B) ^ ^ 

c,GA e,es 


W)>0V(A,P)6P(P) 

If = 0, a generalised greedy algorithm for solving (3.1.1) is as follows. 

Order the elements of E according to 
P = {ii,Z2,...z„ : KJ > |CiJ > ... > |ct„|} 
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The vector x € 72." solves (3.1.1), where 

f z/ Ct > 0 

= < 

[ — if Cl < 0 

the indices i being ordered as in P and, 

(^ 0 , Bo) = (<f), (f) 

(4, Bi) = (A-i, .Sz-i U ej if Cl >0 

= (A-i U Cl, 5j_i) if Cl <0 

The vector x also represents an extreme point of the polyhedron P{f). 

In case /(^, (p) # 0, it is not necessary that the greedy algorithm will solve (3.1.1). 

In fact, the case f{(p,<p) 6 (0,oo) creates no problem for the greedy algorithm, but if 
f{(p, (f>) E (— oo, 0), then the greedy algorithm may not even obtain a feasible solution. 

In what follows, we investigate the reasons for which the generalised greedy algorithm does 
not work when /(^, (p) < 0. 

For the LPP (3.1.1) form the diset (X,Y) such that 
X = {ci : Cl > 0} and Y = {ci : Ci < 0} 

So the objective function in LPP (3.1.1) can be written as 

max ~ H 1 

yet6£' J exGK J 

Definition 3.1.2. Let f be a bisubmodular function andP{f) be its associated polyhedron. 
Let X € P{f)- A set (A,B) is said to be a x-tight set if x{AiB) = f{A,B). 

Fact: [46] Let / be a bisubmodular function and f{p, <p) = 0, then union and intersection 
of any two x-tight sets are x-tight. 

Lemma 3.1.1. Let f be a bisubmodular function with f{(p, <p) <0 and x 6 P{f)- If (A,B) 
and (C,D) are any two x-tight sets, then {A,B) U {C,D) and {A,B) n {C,D) need not be 
x-tight. 
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Proof. Since x 6 P{f) and (A,B), (C,D) are x-tight sets x{A, B) = f{A, B) and 
x{C,D) = f{C,D) and hence, 

xiAuC\BuD,BuD\AuC)+x{AnC.BnD) = x{A,B) +x{C,D) = f{A,B)+f{C. 

(3-1 

But, 

f{A, B) + f{C, D) > f{A U C\B UD,BU D\A UC) + f{AnC,Bn D) (3.1.4) 

If (A, B) n {C, D) = (p and f{(f>, <p) < 0, it is possible that 
fiA, B) + /(C, D) < f{A UC\BUD,B\J D\A u C) 

X{AUC\BUD,BUD\AUC) = xiA,B) + x(C,D) 

= f{A,B)+f(C,D) 

< jiAuC\BuD.BUD\AuC) 

and xiiA,B)niC,D))> 

Thus (A, B) U (C, D) and {A, B) fl {C, D) need not be x-tight. However when {A, B) D 
(C. D) ^ (p then by equations (3.1.3) and (3.1.4) we have union and intersection of x-tight 
sets are also x-tight. □ 

Definition 3 . 1 . 3 . A (0, ±1) matrix B is said to be a signed nested matrix if it can be 
transformed into a nested (0,1) matrix by a ±1 scaling of its columns. 

Definition 3 . 1 . 4 . Linear program (3.1.1 ) is said to have signed nested property if for any 
c in such that (3.1.2) has an optimal solution, there exists an optimal solution y* to 
(3.1.2) such that the row submatrix of the coefficient matrix of (3.1 2) corresponding to the 
positive components of y* is signed nested. 

Theorem 3 . 1 . 2 . (Kabadi,R.chandrasekaren) 

If f is a bisubmodular function with f{6,(p) — 0 then the following statements are equiva- 
lent: 

(1)(3.1.1) has signed nested property. 
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(2) Generalised greedy procedure yields an optimal solution Vc e 

(3) The function f is bisubmodular. 

Lemma 3.1.2. Theorem (3.1.2) does not hold when f{(p, o) < 0. 

Proof As in the submodular case one can show that even when f(f>, (p) < 0, (1) (2) 

and (2) => (3) 

Let X denote an optimal solution for (3.1.1). However, as discussed earlier, union and 
intersection of x-tight sets need not be x-tight and hence the row submatrix of the coefEcient 
matrix of (3.1.2) corresponding to the positive components of x need not be signed nested 
and bisubmodularity of / need not guarantee that the greedy algorithm will yield an 
optimal solution to LPP (3.1.1). So (3) (2) and (3) 7^ (1) □ 

Consider the following example. 

Example 2. 

max{xi — X 2 ) 

Xi < 1 Xi — X 2 < 3 

—Xi < 1 — Xi + X 2 < 0 

X 2 < 1 Xi -i-X 2 < 4 
—X 2 < 1 —Xi — X 2 < 2 
f{i,,4,) = -2 

The generalised greedy algorithm produces the solution (3,2) which is not a feasible 
point. The primal optimal solution is (1,-1) and the corresponding optimal solution of the 
dual problem is (10010000). The row submatrix of the coefficient matrix corresponding 
to the positive components of the optimal solution is 


/lOO 0 0000 

^000 -1 0000 
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which is not a signed nested matrix. 

Another important consequence of the value of / at empty set is as follows: 

Consider the LPP given in (3.1.1) and the LPP 

max I ^ Ct,Xi — E (3.1.5) 

Ve.6X e,6r j 

Subject to x(A, B) < /(A, B) V(A, B) C (X, Y) 
where Cj € X if Cj > 0 and ej € P if Cj < 0. 

If /(^, (f)) = 0, from the generalised greedy algorithm it follows that 

max ~ IZetSK = max ~ 12e,eY 

Subject to subject to (3.1.6) 

xiA, B) < f(A, B) V(A, B) e D{E) x{A, B) < f{A, B) V(A, B) C (X, Y) 
However when f{<p, <p) < 0 

max (Ee,€X - Ee.er < max (Ee,€X - Ee,€K 

Subject to subject to 

ai(A, B) < f{A, B) V(A, B) € D{E) x{A, B) < f{A, B) V(A, B) C (X, Y) 

Since the optimal solution of the r.h.s problem need not be feasible for the l.h.s problem. 
The linear programme on the r.h.s of 3.1.6 can be formulated as an LPP over a submodular 
polyhedron as follows: For A C X U F define /'(A) = /(Ai, A 2 ), where Ai = A Pi X and 
A 2 = A n F. /'(A) is a submodular function over the subsets of X U F. 

Let c( = Cl Vci € X 

= — Cj Vci G F 

The LPP on r.h.s of 3.1.6 can be rewritten as 


max : y{A) < /'(A) VA C X U f| 
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The case /(^, (f>) > 0 can be handled in the same way as for the submodular case in the 
previous chapter. 

Solving problem (3.1.1) does not appear to be easy for the case f{o.(p) < 0. However we 
give an efficient algorithm for solving the LPP, given in (3.1.5), defined on an orthant. 

3.1.2 Algorithm 

To solve problem (3.1.5) when f{ 4 >,(!>) < 0 we need the following results and definitions. 
Lemma 3.1.3. Let f be any btsuhmodular function, defined on D{E). 

where (X,Y) is given diset and (X,Y) e D{E) 

IS equivalent to finding 

min{f{A)) where E= {X U H) and f is a submodular function defined as follows 
f{A) = f{Ai, A2), where (^1) = (.4 n X) and As = (A fl F) 

Proof. We will show that / is submodular, on subsets of X U Y 

> /(Ai U Bi\A2 U H2, A2 U H2\Ai U Hi) + /(Ai n Hi, A2 fl H2) 

= f (Ai U Hi, A2 U H2) + f (Ai n Hi, A2 n H2), Since Ai n H2 = 0 = A2 n Hi 
= /(AuH) + /(AnH) 

/ is submodular, and by definition of /, it follows that 


min /(A) = min /(Ai,A2) 

.4C£:=(Xuy) ^ ' {Ai,A 2 )C{X,Y) ^ 


Hence the proof. 


Note: We have already proved in chapter-2 that, to find a non-empty minimising set of a 
submodular function /, the oracle for minimising / may be used atmost \E\ times. 
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Definition 3.1.5. A bisubmodular function f is said to be a zero bisubmodulur function 

with respect to a given diset (X, Y), if 

f{e, (f) = 0 and f{^, e) = 0 V(e, or [cp, e) e (A", Y) 

Definition 3.1.6. A function g is said to bimodular if 

g{A,B)+g{C,D) =g(AuC\BuD,BuD\AuC) +g{AnC,BnD) 
yiA,B),iC,D)eDiE) 


Lemma 3.1.4. If g is a bimodular function and if P{g) ^ (p, then g{(p, <p) > 0. 

Proof Let (A, B) be any non empty diset. 
g bimodular g{A, B) + g{B, A) = g{0, (p) + g{(j), (jf) 

If g{f), f>) < 0, then g{A, B) + g{B, A) < 0 
Since P{g) ^ 4>, by theorem (3.1.1), 

^(.4, B) + g{B, A) > 0 V(A, B) (<p, (p) Thus > 0. □ 


Let us assume that / is a zero bisubmodular function with respect to a given diset (X,Y). 
If not, then convert the given bisubmodular function to the zero bisubmodular function by 
subtracting a bimodular function w which is defined as follows. 


V(e,?i) eiX.Y) 


w{e,(t>) - f{e,<p) 
uj{d,e) = -/(e, 0) 


w{(i>, 0) = 0 


V(0,e) G(A-.r) 


u;(0, e) = /(<?),e) 
w{e, 0) = -/(0,e) 


Clearly w is bimodular and f — w is a. zero bisubmodular function with respect to (X.Y). 
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Lemma, 3.1.5. Solving the LPP over P{f) 2 S equivalent to solving it over P(^f — w). 

Proof. From the definition of / - lo it follows that if a: 6 P{f - w) then x + w E P{f), 
thus if, {xi,X 2 ...Xn) is an optimal solution for an LPP over P(/ - w) then (x[,x' 2 ...x'J is 
an optimal solution for the same LPP over P{f), where 

ai, + wii. (t>) if («, 4)) G (-Y, Y) 
x,+w{(^,i) if {(j>,i) E {X,Y) 

□ 

From now on we will refer to the LPP (3.1.5) with / a zero bisubmodular function with 
respect to the given diset (X,Y). 

Let us first solve the case where cfs are ±1. 

Let X — {i : Ci = 1} and Y = : Cj = —1} 

Now the LPP is, 

Maximise j x^ — x^ 

\e.eA' ej^Y 

Subject to a;(X, jB) < f{A,B) where (A, P) C (-Y, F) (3.1.7) 

We extend the algorithm of the previous chapter for solving the LPP associated with sub- 
modular functions to the LPP associated with bisubmodular functions. Since minimising a 
bisubmodular function over a given diset is equivalent to that of minimising a submodular 
function by lemma (3.1.3) , we will assume that an oracle for minimising a submodular 
function is available to us. Modifying the diset (X,Y) at each iterations and maintaining 
the bisubmodular function as a zero bisubmodular function with respect to the modified 
diset (X,Y) at each iterations are the main steps involved in the algorithm. 

For a given zero bisubmodular function / with respect to (X,Y), define a submodular 
function /' as in lemma (3.1.3). 
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Intialy find a non-empty minimum set A' of E such that f'(A') < 0 and for sets of B^’s, 
C A such that = 0 and UB^ = A' ^ ^ f'[B^). If there exist no such set then 

max X] OTi - ^ Xj = 0 

i€X jeY 

and hence = 0 Vi is the optimal solution. 

If A' is the set with above property and (A,B) is the corresponding diset of A' , then set 

max /(^> B) 

A B 

Set E' = {(£■ - A'), (A')}. Treat the set A' as singleton in E' . Let A' = e' say. So 
E' = {E~ A!) U e'. 

Let w be a modular function such that (/' — uj) is a zero submodular function on E'. Set 
E = E' and f' = f' — w and repeat the above process until there exists no set .4' on E 
with the above mentioned property. We now give the concise version of our algorithm. 
Procedure (Mm-Neg(/,E)) 

(This procedure is there in chapter-2) 

The procedure Min-Neg(/,E) will find a non empty minimising set (A') for a submodular 
function / having the above mentioned property. If there is no such set then it will return 
an empty set. 

Note 3. If the procedure Min-Neg(f,E) returns { 0 ) at the first iteration itself then 

Max XZ ~ XT ~ ^ where xi = X2= .. = = 0. 

Vex j 

We now give the main algorithm for solving the LPP (3.1.7). /' refers to the submod- 
ular function obtained from / with respect to the orthant (X,Y). 
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Main Algorithm 
i=0 
Begin 
Step 1: 

(.4') = Call Min-Neg (/', E) 

If (A^) = (^) then go to step 4. 

Step 2: Let the corresponding diset of .4* be 

(Find the new set E' and the function /' as discussed earlier) 

Step 3; Go to step 1 with the new set E' and /'; 

Step 4: Stop 
end 

At the end of the algorithm we will have the following partitions of (X,Y). 

(a; Y) = (Ai, U (A 2 , J52)...(Afc, B,) u (A\ A, r\ ujLi B,) 

Now 

V(e, or {4>, e) G {X/ uf^i 4, Y/ ufLi B,) , = 0 

and the optimum value is 

Max ( E - E = E 
V*eA' jer J t=i 

\ alidity of the algorithm has already been proved in the previous chapter for the sub- 
modular case. 

We can now solve the LPP (3.1.5) for any c > 0. 

Using the above algorithm, replace / in (3.1.5) by / where, 

f{A, B) = max ( I. - I. I x{C, D)<f{C,D)V (C, D) C {A, B)\ 


The following lemma is obvious 
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Lemma 3.1.6. Solving the LPP (3.1.5) with the function f is equivalent to that of solving 
it with f 

One can apply the generalised greedy algorithm to solve (3.1.5) with the function / . 
For that we need to calculate the value of / only at n disets, which can be found by the 
above algorithm. 

Complexity 

The complexity of the algorithm is the same as for the submodular case dealt in Chapter-2. 
Thus our algorithm will solve LPP 3.1.5 in o{rPk), where k is the complexity for minimising 
a submodular function. 

From equation (3.1.6), one can see that if x' is an optimal solution to the LPP 3.1.5 and 
if x' € P(/), then x' is also an optimal solution to (3.1.1). If x' 0 P[f) then, solving LPP 
(3.1.1) is not easy. However checking that x' € P[f) will require solving a membership 
problem for P{f). Wc show in the next chapter that the membership problem is equivalent 
to minimising a bisubmodular function. If x' ^ P(/), since the solution vector x of LPP 
(3.1.1) is not necessarily an extreme point of the polyhedron associated with LPP(3.1.5) 
it is not clear how the knowledge of x' can be used to obtain x. 

3.2 Examples 

In this section we discuss some problems where bisubmodular functions with f{(j), ^) < 0 
occur. 

Extension of bisubmodular functions 

In this we will show that bisubmodular functions with negative values on empty sets occur 
in the extension of bisubmodular function defined on intersecting families IF to all of D{E). 
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Let T he a. intersecting family with (0, (p) 

Let (.4i, B\), (.42, Bk) be the maximal members of the intersecting family T. 

Now for V(A'', Y) C D[E) , (.Y, Y) ^ (^, 4>), T contains a unique smallest subset including 
(X.Y). Let it be denoted by {X,Y). 

Define 

flXVl = i f{(X,Y)n{A„B,)) if (.Y,y)n(A,s.)#(^,« 

’ I -c if (.Y,y)n(,4..B.) = W.«>) 

Now the extension is defined as follows 



where c is a large positive number. 

Now’' /(^, (f))=-kc +kc -c= -c. 

Here this bisubmodular function has negative value on the empty set. 


Partition Problem 

Partitions of directed sets 

Definition 3.2.1. We say that {(Ai, Bi), {A 2 , B2)...{Ak, Bk)) t-s a partition of (A,B) if 
(.4,. B^) e D{E)\/i , (.4, n 5,) = and (.4,, H,) = (.4. B) 

Let Tt denotes the collection of all partitions of (A,B) 

Let us define 

k 

fiA,B) = mmY^f{At,B^) 

l=l 

Here the minimum is taken over all the partitions of (A,B) 

Now consider the dual of the LPP (3.1.7) 

(D) Minimum J2y{A,B)f{A,B) 

subject to .|^.j/(A,5) = 1 

^y{A, B) = —1, y(^) .S) > 0 V (.4, B) C (A, Y) 
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Here any partition of (X, Y) is a feasible solution to the dual problem. 

Note that the set of all minimum partitions of (X.Y) forms a lattice. 

The following lemma is obvious. 

Lemnaa 3.2.1. If f{(p,(f)) = 0 then, 

Minimum ^ y{A, B)f{A, B) = f{X, Y) 

Consider the above dual problem with / replaced by f where /' = /—A and A € (0, oo), 
then f is a bisubmodular function having negative value at the empty set 
Note: P{f') can be empty by theorem-(3.1.1), but one can always find a partition of a 
given diset (X,Y) with respect to /' by using the algorithm given in the previous section. 

3.3 Properties of BSM functions with /(^, </>) < 0 

Total dual Integrality 

Let / be a bisubmodular function. Define a function f as follows: 

f'{A, B) = max{x{A, B) : x £ P{f)} (3.3.1) 

When f{<i>, <f>) = 0, /'{A, B) = f{A, B) 

If f{4>, (jf) < 0, f'{A, B) < f{A, B), but the following claim can be made. 

P(/) = F(/') and /' is a bisubmodular function. Again since P{f) is bounded /' is always 
finite. 

Theorem 3.3.1. Let f be a bisubmodular function with f{4>,((>) < 0, then the system of 
inequalities {maxcx : x G P{f)} is TDI. 

Proof. By the above claim, max{cx : x G P{f)} = max{cx : x G Pif')} 

Since f is bisubmodular and f'{<f), 0) = 0 it follows from the greedy algorithm that r.h.s 
has the TDI property. 

Since P{f) = P{f), it follows that the left hand side also has the TDI property. □ 
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Base polyhedron 

Let (/, D{E)) be any given bisubmodular system and (X.Y) be any orthant m D{E). Define 
the polyhedron P{x,Y){f) as follows: 

P{K,Y){f) = {x|x G 72.^V(,4,B) C {X,Y) : x{A,B) < f{A,B)} (3.3.2) 

Now we will define the base polyhedron of the orthant (X.Y) as, 

= {x:xe P(x.y)(/)<x:{X, Y) = /(A', y)} (3.3.3) 

One can see that this base polyhedron B(x,y) if) the same combinatorial structure as 
the base polyhedron of a submodular system. Hence when /(<?, p) < 0, the base polyhedron 
Bixy)if) can be empty even when P(xx)if) is non empty. 

When f{<p,(i>) > 0, to check B(x,Y)if) is empty or not it is enough to check P(x,Y)(f) is 
empty or not. However it is not the case when /((^, (j)) < 0. 

Lemma 3.3.1. Let f be any bisubmodular function with /(©. d) < 0, then B(^x,Y)if) 
non empty if and only if P(x,Y)if) *s nonempty and there exist no partition (j4j,5j)f_i of 
(X, Y) such that 

ZfiA.B,) < f{X,Y} 

Proof. If P(x,Y]if) is non-empty and if there exist a partition (.4i,5i)f_i of (X,Y) such 
that 

i^f(A,B,)<f{X,Y) 

1=1 

then for every x G P{x,y) if) 

x{X.Y) = < y /(A.s.) < f(X:Y) 

2=1 

x{X, F) < f{X, Y) and hence the base polyhedron is empty. □ 
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Connectedness 

Let (/, D{E)) be any bisubmodular system, with /(o, 0 ) = 0 For any given orthant (X.Y) 
define F(A',r)(/) and B(^xy)U) as in equations (3.3.2) and (3.3 3). 

Let /(A',y) be the submodular function corresponding to the polyhedron P(x,Y){f)- 
That is for ACE f(x,Y){A) = f{Ai, A2), whereAi = AnX,A 2 = AnY 
VVe have already shown that P(A'-,r)(/) = P{f(x,Y)) and B^x,Y){f) = B{f(^x,Y)) 

Definition 3.3.1. Let (/, D{E)) be any bisubmodular system with f{4>, (j)) = Q and P{f) ^ 
0- {fiB{E)) IS said to be connected if for any orthant (X.Y), the submodular system 
{f(x,Y},D) is connected, 

that is there exists no non empty proper subset A of E such that 

/(A-,r)(F-A) + /(A-,y)(^)=/(A,y)(F) 

In other words 

for any given orthant (X, Y) there exists no non empty proper subset A of E such that 
f{X n A,Yn A) + f(X/A, Y/A) = f(X, Y). 

Let us now extend the concept of connectedness to a bisubmodular system {f,D{E)) 
with f{4>,(j>) < 0. When /(<6, 0) < 0, then from lemma (3.3.1) it follows that B(^x.Y]{f) 
can be empty and hence it is not possible to define the connectedness for the submodular 
system (/(a'.k), -D)- To avoid this we define a function /' as in equation (3.3.1). 

Clearly /'(<■/>, 0) = 0 and yj is a submodular function with respect to the orthant (X.Y). 

Definition 3.3.2. Let {f,D{E)) be a bisubmodular system with f{(j), cj)) < 0. We will say 
that {f,D{E)) IS connected if for any orthant (X.Y) the submodular system (/(A',y))F)? 
where D is the collection of subsets of E is connected, 

that IS for any given orthant (X, Y) there exists no non empty proper subset A of E such 

thatf{XnA,YnA) + r(X/A,Y/A)=f(X.Y). :£iVTSAL LiBRAR* 

I I. T.. KANPUR 


U5683 
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In the following lemma we characterise connectedness of [f,D{E)) when /(0, 0) < 0, 
in terms of the function f. Our proof of the lemma differs from that of [ 4 ], 

Lemma 3.3.2. Let {f,D{E)) be any connected system, with < 0 then there is no 

{A.B) € 3^ such that f {A, A) AAB^^E^d and{A,B) , (S,A) G D{E). 

Proof. Since (/, D{E)) is connected, from our definition it follows that 

for any orthant (X.Y) the submodular system {f[xyyD) is connected. By theorem ( 2 . 3 . 3 ) 

of chapter -2 there exists atleast one base x 6 B(^x,Y){f') such that 

^(-- 1 ) < f[x,YM) cage 

=> x{Ai,A 2) < f'{Ai,A2), V {4>,4>) C (^4i.42) C (A',!') 

Let (A,B) be some diset such that AU B ^ E,d> and f'{A. B) + /'(B, A) = 0 . 

We have atleast one orthant (X.Y) such that (.4, B) C (ALT). 

Let flx.Y) submodular function corresponding to (X.Y) There exists atleast one 

X G P{f') such that x G B(x,y)(/0 and 

x{A.B)<f'iA,B) 

.r(.4, B) + x{B, .4) < /'(A, B) + f'{B, A) = 0 . 

(since (B,.4) G D{E) and x{B,A) < /'(B, ,4).) 

x(.4, B) 4- x{B. .4) < 0, which is a contradiction. 

So there is no (A,B) with (A U B) 7 ^ B, 0 such that /'(A, B) + f'{B, A) = 0 

Hence the lemma. □ 

Definition 3 . 3 . 3 . A bisubmodular system {f, D{E)) vjithf{o,^) < 0 andP{f) ^ f is said 
to be fully connected if there exists no {(f), <f)) (A, B) G 3^ such that /'(A, B)-f /'(B, .4) = 

0, where f'{A, B) = max{x{A) — x{B) | x G B(/)). 

That is we say that (/, D{E)) is fully connected if (/', D{E)) is fully connected. 

Note 4 . Combining the criterion for non emptiness of P{f) in theorem (3.1.1) and the 
definition for fully connectedness of{f',D{E)) it follows that {f',D{E)) is fully connected 
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zf f(A,B)+f'{B,A) > 0 ^ {A,B) e D{E). 

Lemma 3.3.3. (f,D(E)) is fully connected bisubmodular system with f{(j),(j)) < 0 if and 
only if P{f) IS full dimensional. 

Proof. First we will show that, if {f,D{E)) is fully connected, then P(/') is full dimen- 
sional. 

Since {f',D{E)) is fully connected, 

f'{A, B) + f'{B, A) > 0 V(0, (p) (.4, B) € D{fE) by the above Note. 

From the proof for theorem (3.1.1), for non emptiness of P{f'), it follows that the interval 
[rn, M] contains an interior point for all k. Hence p(f') has an interior point. 

So if (/, D(E)) is fully connected then P{f') is full-dimensional 

Since fiA^B) < fiA.B) V(.4,B) € D(E) {A.B) ^ it follows that P{f) is also 

full-dimensional. 

Conversely if P{f) is full dimensional we show that the bisubmodular system (f,D{E)) is 
fully connected. Since P{f) = P{f'), full dimensionality of P{f) implies full dimensional- 
ity of Pif). By our definition f'[(p,<p) = 0 and by Fujishige’s result in [5], it follows that 
{f\D{E)) is fully connected. D 

111 the above section we have made an attempt to extend the concept of connectedness 
to the bisubmodular functions having /((?i, (p) < 0. Other results regarding connectedness 
such us defining connectedness for the restriction of / to some set A and obtaining a 
bidirected graph regarding connectedness can also be extended in a similar manner. 

Bisubmodular function Minimisation 

Bisubmodular function minimisation does not depend on the value of / on empty set. In 
case f{(p.,<p) < 0, one can always convert / into another bisubmodular function f’ where 
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f'{A, B) = /(-4, B) - f{o. <?), and 


min 

(A,B)eDiE) 


f{A,B) 


min 

(A,B)eD{E) 


f'U,B). 


In chapter-4 we discuss the mm-max theorem for a bisubmodular systems {f,D{E)) and 
show that this theorem is not valid when /(<^, < 0. 



Chapser 4 

Bisubmodular Systems 


4.1 Introduction 

A bisubmodular polyhedron is defined in terms of so-called bisubmodular function on a 
family of ordered pairs of disjoint subsets of a finite set Bisubmodular systems were 
examined by Fujishige [4], L. QI [46] and guha et.el [33]. Rank functions of ditroids, 
perfect ditroids and oriented inatroids are increasing bisubmodular functions. Again a 
g-polymatroid defined with respect to two increasing submodular functions is also a bisub- 
modular polyhedron associated with a increasing bisubmodular function. 

In this chapter we make an attempt to study the structure of increasing bisubmodular 
functions and their associated polyhedra. We then extend these concepts to the general 
bisubmodular function and its associated polyhedron. We also prove the min-max theo- 
rem for the !)isubmodular polyhedron in a way different from that of Cunningham [16] and 
investigate the relationship between the separation problem and bisubmodular function 
minimization. 

At the end of the chapter we minimize a bisubmodular function with respect to some 
simple constraints that arises in some practical problems. 
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4.2 Increasing Bisubmodular functions 

In [3 ( ] CIi3.nciras6karen and Ivaba.di introduc6d th6 coiic6pt of pseudomatroids. Xh6v gave 
the polyhedral characterization of pseudomatroids as. 
x{A. B) < b{A, B) V(,4, B) € D{E) 

where b : D{E) is an integer valued, bisubmodular function on D(E), satisfying the 

following conditions: 

1) b{(p. 0) = 0 

2) 6(ei, (j!i) = 0 or 1 Ve, € E 

3) {.4 CC.DCB: (.4, B), (C, D) € DiE) => biA, B) < b{C, D). 

Note that hero b is not an increasing bisubmodular function on D(E). In this section we 
study increasing bisul)inodular function and the polyhedral characterization is given by. 
x(.4..S) < /(.4,B) V(.4,B) e D{E) 

where / is a bisubmodular function satisfying the following conditions; 

(1) /(<6.d) = 0 

(2) / is increasing. That is, 

{V(.4.B),(ai:?) e D{E) such that(.4,S) C (C.D). f{A,B) < fiC,D)} 

A study on increasing bisubmodular function will help us to characterize different kinds of 
bisubmodular polyhcdra such as Ditroid polyhedron, g-polymatroid corresponding to two 
increasing subrnodular function and the polyhedron corresponding to the rank function of 
the oriented matroids. 

Wh have shown that the polyhedron P{f) associated with an increasing bisubmodular 
function has many properties similar to that of a polvmatroid. We also show that tight 
sets corresponding to non-zero components of v € P{f) will be non-cancelling. Using this 
property we define the closure, support function and partial order for any v G P(/) in the 
same way as for a subrnodular function. 
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Definition 4.2.1. Two disets (A,B) and (C,D) are said to be cancelling sets if 
(.4 n D)u [B r\C) ^ (p 

Definition 4.2.2. An incremental function of a function f D{E) Tl is defined as 
follows: 

fji^) ~ fi^ A- j) — f{X) where X — {Xi,X 2 ) € D(E) and we write X + e m place of 
X U (e, <f)) or X U {4>, e) It will always be clear from the context, whether e is a forward 
element or a backward element. 

Below we give two lemmas from [34] which characterise increasing bisubmoduiar func- 
tions in terms of their incremental function. 

Lemma 4.2.1. Let f : D{E) 7Z. be a non- decreasing function, f is bisubmoduiar if and 

only if its incremental function is non-increasing, le if and only if, 

fjiX) = fix +j)- fiX) > fix +j + k) - fix + k) = f,iX + k), 

for all j, k € E/X j ^ k. 

Lemma 4.2.2. / is an increasing bisubmoduiar function if and only if for any two non- 
cancelling disets (A,B).(C,D) € DiE), 

nA.B)<nC,D) + -Z,(f(CUj.D)-nC,D)) + Y:{f{C,Duj)~f{C.D)) 

We now show that tight sets corresponding to non-zero components of u G P[f) are 
non-cancelling. 

Theorem 4.2.1. Let f be a increasing bisubmoduiar function and P(/) be the polyhedron 
associated with it. For v € Pif), tight sets corresponding to non-zero components of v are 
non- cancelling. 

Proof Let Ug > 0 and (A4-e,B) and (C,D-be) be two v-tight sets That is, 


viA -\-e,B) = /(.4 -i- e, B) 
viC,D + e) = fiC,D + e) 


(4.2.1) 

(4.2.2) 
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Now from equation (4.2.1) 

Ve + v{A, B) = f{A + e,B)> f{A.B) (since / is increasing) 

=>Ve + v(A,B)>fiA,B) 

Ve> fiA, B) — v{A, 5) > 0 since v{A. B) < f{A, B) 

So Ve > 0 

Similarly from equation(4.2.2) we get 
—Ve + v(C, D) = /(C, D + e) > f{C, D) 

Ue < 0 

SO Ve = 0. which is a contradiction. 

Hence the proof. □ 

VVe can now' define the support and closure functions for v € P(/) 

Definition 4.2.3. Support function for v 6 P{f) is defined as follows 

I (e, <P) if Ve> 0 
iiup{v) = < U 

[ e) if Ve<0 

Since either (e, 4>) or (p, e) G sup{v) for any v G P{f), sup{v) ^ (j), for u 7^ 0. 

Definition 4.2.4. Closure function for v G P{f) is defined as follows 
c/(u) = u((.4, P) I (.4, J5) IS V — tight set) 

Note: Since tight sets corresponding to non-zero component of v are non-cancelling one 
can always define the closure function as above. 

Closure of v is always tight but support need not be. 

Definition 4.2.5. We can now define the partial order on the elements of cl(v). 

For (e, p) or e) G cl{u) 

(A, B)l^^^ = (n(.4. B) I (e, (/>) G (-4, B) where {A, B) is v - tight) 
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•®)(i?>,e) (n(.4. 5) j e) G {A, B) where [A,B) is v — tight) 

Since f IS increasing for any Ci, 62 G cl(v) only one of the following conditions is true: 

(1) (ei, 0) e (.4, (2) (ei, (p) e (.4, 

(3) (<^. ei) G (.4, (4) {f>, eO e {A, 

We can define the partial order < on the elements of cl{v) as follows. 

We say ei < 62 , if any one of the above four conditions is satisfied. 

Thus ei < 62 implies that there is a diset C C cl(v) such that ei, 62 G C, and the 
foru'ardness and backwardness of ei and eo are determined by the above conditions. 
Suppose ;; G P{f) is an extreme point of P(/), as already proved in [27], a subset of 
y-tight constraints will form a lattice in D{E) and the largest set of this lattice is sup(v) 
and hence will be a tight set. Since /(o. o) = 0 . by definition cl{v) is also a tight set and 
sup[v) C cl{ii). 

We now show that given an orthant of E and a partial order on the elements of E, we can 
generate an extreme point of P{f) 

Let (A,B) be the given orthant and let the partial order be ( 61 , 62 , 

Define (.4°, B^) = {4>, (p) and 

(Ah B^) = (.4*-^ U e„ B^-^) if 6 , G A 

= ife, gP 

(Here oiu' has to add elements to the set (.4*, P') in the given partial order.) 

!’ G P{f) is an extreme point, where component Ue, is given by the following relation. 

Ue. = f{A\B^)-f{A^-\B^-^) ife^eA 

= f(A^-\B'-^)-fiA\B^) ife^eB 
Theorem 4 . 2 . 2 . v generated by the above process is an extreme point of P{f) 

Proof Let us first show that v is a feasible point and then we can show that it is an 
extreme point. 
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To prove that v{C, D) < /(C, D) for every (C, D) e D{E) 

Nonce that if e, 6 .4 then r-e. = J{A\ B') - > 0 (since / is increasing) 

and if gj e B then Ug, = /(.4*~\B*~^) — f{A\B’-) < 0 
Case(l); (C,D) C (.4,B) 

In this case it is obvious that v{C, D) < f{C, D). 

Case(2):(C',B)g(.4,B) 

Let F = (Ci/et G C and e, G B) and 
F" = (ct/ej G D and gt G A) 

v{C,D) = v{C)-v{D) 

< v{C/F') - u(B/B") (Since C/F' C A and D/F' C B) 

= v{C/F\D/F") 

< f{C/F\D/F'') (follows from Case-1) 

< f{C,D) since {C/F\D/F") C {C,D) and / is increasing. 

=> ;.'(C.B) < f{C,D) 

So we have v{C, D) < /(C, B)V(C', B) G D{E) 

=> r is a fcuisiblo point. 

Again since j A U B |=n one can easily see that there will be n tight sets corresponding to 
v This proves that v is an extreme point of P{f) 

If (A, B) is not an orthant of E then by putting tv, = 0 Vg, 0 A U B we will obtain a 
V G P{f) by the above process, which may or may not be an extreme point of B(/). □ 

In the next section, we show that the algorithm in [9] for separation problem of a polyma- 
troid can be extended to the separation problem of an increasing bisubmodular polyhedron. 
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Example 3. 

<1 -Xi <1 

2^2 < 1 -X2 < 1 

X 1 -X 2 < 1 -Xi + X2 < 1 

X1+X2 < 2 — Xi — X2 < 2 

The extreme points of the above polyhedron are {(0, 1)(1, 0)(1, 0)(-l, -1)(0, -1).} 
Consider the extreme point(-l,0). Tight sets of this extreme points are 1), (2, 1)} 
ci(-l, 0) = (2. 1), (,4, = (<6. 1) and (^. B)}-;;"' = (2, 1) ^ 1 < 2 

So the partial order is that the element 1 should occur before 2. 

One can generate the extreme point corresponding to the diset (2,1) with the above partial 
order as follows. 

'«! = /(?>• (i>) - 1) = -1 V2 = /(2, 1) - /(o. 1) = 0 

So (-1,0) is the extreme point generated by the diset (2,1). 

Examples 

In what follows we give examples where the increasing bisubmodular functions and their 
iissooiated polyhedra occur. 

Ditroids 

Let D = {E,X) be a ditroid as defined in chapter-1. 

For details you may see [34]. 

Consider the rank function h of D, which is defined as. 
h{A, B) = max{\C U D\ : (C, D) C {A, B) and (C. D) € X] 

Clearly h is an increasing bisubmodular function and the corresponding polyhedron is the 
ditroid polyhedron for the ditroid D. 

Oriented Matroids 

Let M' = (JF, O) be an oriented matroid also defined in chapter-1. 
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For details you may see [10]. 

If = {E, O) IS an oriented matroid with O as the collection of its circuits, then the 
unsigned circuits of O, denoted b\ O , define a unicjue matroid associated with the oriented 
matroid {E,0). That is 

X = {/ : C g / for all C e O} 
is the independent sets of (£■, C?), and 

T' = {X': Xel}, 

are the independent sets of the matroid {E. O'). Xote that if C 6 O then -C £ O. If r' 
and r denote the rank function of {E, O) and [E. O') respectively, then 

r'(A'i, A'a) = U I2I ; (Ti,!;) C (A'l. AT), (li,I' 2 ) € X} = r(AT U A2). 

One c‘an see that the rank function r' of an oriented matroid is also an increasing bisubmod- 
ular function and the corresponding polyhedron an increasing bisubmodular polyhedron. 
g-polymatroids 

Let gji be any two increasing submodular functions such that, 
g{X) + /i(r) > (?(A7r) + h[Y/X) VAT Y CE 
Consider the g-polyrnatroid defined as, 

Q{g.h) = {x e < xiA) < 

Define /(,4, B) = i/(A) + h{B) VA, B C £ and ,4 n 5 = 0, 

then / is an increasing bisubmodular function and Q{g,h) = P{f)- 

4.3 General bisubmodular functions 

We now extend the above concepts to a general bisubmodular function. 

From now onwards / will denote a general bisubmodular function unless otherwise stated. 
Let / be a bisubmodular function and P{f) be its associated polyhedron. 
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V £ P{f) be any vector in the polyhedron. 

Let A = ((ALF)I {X,Y) is v — tight) 

As seen in the previous section if / is not increasing, then sets in A can be cancelling. 
Thus in order to define d(v), we need to talk of forward and backward elements in u-tight 
sets. 

cP(v) = {e £ E \ e is a forward element in the v-tight sets in A) 
d~{v) = {e £ E \ e is a backward element in the v-tight sets in A) 

Let d{v) = d'^{v) U d~{v) 

Now Vc £ c/ ■*■(«) define 

A*W=n((.Y,r)| (.\',K)eA) 

and Ve £ d~{v) define 
A:(«) = n((A', r) 1 r € >' (A, Y) £ A) 

A graph G{E, .4) is said to be bidirectcd graph, if an edge e £ A has two heads, (or) two 
tails (or) one head and one tail. In order to define bidirections, it is convenient to let each 
edge be incident to two half edges. Each half edge is incident to precisely one vertex and 
OIK' edge. L('t H b(> the s('t of half edges. 

A biorientation (or) Indirection, over G is a function a : H -i- {— 1,+1}. 

We will denote a bidirectcd graph by (G, .4. a). In [11] Bouchet et.el construct a jump 
system for a givt'n bidirected graph tis follows: 

For f £ Z-'\ and u G E, the excess of /' at E is 
exf'{v) = {23 : a is an edge incident to v} 

Given with Ci < C 2 , we denote by [cj.ci)] the set 

{/' G 2:-*^ : cr < /' < c^}- 

It has been shown in [11] that the set {E,ex{f') : f' £ [ci,C 2 ]} is a Jump system. For 
details see [11]. 
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We now define the concept of a degree sequence for a bidirected graph G{E, A, a). 

Definition 4.3.1. Let denotes the degree sequence of the bidirected graph G{E.A.a) 
then. 

= (number of half edges having heads at v)-(number of half edges having tails at v). 

In the above jump system put ci = 0 and cg = 1. for each edge e, then 
[ex[f'), f G [0, 1]} is the set of degree sequences of subgraphs of the bidirected graph 
G{E.A,a). Define 


ex{f'){A,B) = Y. exf{v) - Y exf{v) V{A,B) e D{E) 

i;€/l v&B 


and a function g <is 


giA.B) 


max 

xe{E. exif), f € [0. 1]) 


(a;(.4)-a:(5)), \/{A,B) e D{E). 


Then .9(.4.Z?) = rxf'{A,B). 

In what follows, giv('n a bisubmodular function / and P(/), then for any v G P(/), one 
can construct the bidirected graph 6^(2?. .4. cv). 

Caseh 

If {b. 6) G .4.,t (a) a) G ^4,7 (6) then(a. b) G .4, with tail at a and head at b. 



Case2: 

If (o. b) G A^ia) or{<p, n) G Ay{b) then(a,6) G .4with heads at both a and b. 


© © 
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CaseS; 

If {b. d>) e A; (a) or (a, (P) G Ay{b) then(a. b) e .4with two tails at both a and b 



Case4: 

If (d, b) e A^ (a) or {a, d) € A^ (6) then(6, a) e ^with tail at b and head at a. 
© © 


Case5: 

Vn G E/(i'^{v) construct a self loop at a with two heads at a. 
Vn G E/d~{u) construct a self loop at a with two tails at a. 



In what follows we talk of exchange capacities [28] of a pair of elements {a, b} G c/(n), with 
respect to v G P(/). 

Exchange capacity of the pair {a, 6} G d{v) is a positive scalar e such that an increase(decrease) 
of e in the co-ordinate of a and a decrcase(increase) of e in the co-ordinate of b in the vector 
V still keeps the vector in P(f). e is the largest such positive scalar. 

The following theorem from [28] gives exchange capacities in the five different cases, men- 
tioned above. 

Theorem 4.3.1. Let f be any bisubmodular function and P{f) be its associated polyhe- 
dron and V G P(/)- 
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1. In Casel w = {v -h e{a) - e{b)) eP(f) where 

t = m^r^{/(A',y)-^;(A■,}■■) | (X.Y) € D(B); ((a,^) £ (A, 7), 6 0 (AuK)) or (W.fi) 6 

(.Y,na^(-Vuy))}. 

2. In cases w = {v + e{a) + e{b)) e P{f) where 

E = mm{/(A',r)-o(A',y) | (A',y) e D{B); Ha.4>] e (A',y),6 5? (Auy)) or ({(>,0) e 
(xy),a^(A'uy))}. 

3. In caseS w = {v - e(a) - t{b)) e P{f) where 

f = min.{f{X.Y) - v{X,Y) j (.Y,r) g D{E). {{(j},a) G {X,Y),b ^ (JY U 
D) or ((/r0) G (A',r),a^(.Yur))}. 

4 .. In Casc4 ic — {t; — (-{a) +f(6)) EP(f) where 

e = min{J{X,Y)-v[X^ Y) \ (AM') G D{E\ {{cp.a) G {X,Y),b ^ (.YuF)) or ((6,0) e 
(A'.}'),a^(A'ur))}. 

5. In cascS 

(a) If a ^ <i'^{v) then 

w = r + t{a) where t = m.in{f{X. 1') - ^(A'. Y) | (A', Y) G D{Ey, (a, 0) G (AT, Y)} 

(b) I f a <4 r/”(r) then 

w = 0 - ^ (a) tohcre c = rmn{/(A'. }') - v{X. 1') | (A', Y) G D{Ey, (cf), a) G (X, Y)} 

If V is an extreme, point of P{f), the formulae for computing e get simplified for the five 
cases as follows: 

(i) e = f{A:f{a) - (6,0)) - u(aI,T(“) ~ 

(it) e = f{A^{a) - (0,6)) - ^(AtW “ 

(hi) e = /(A; (a) - (6, 0)) - t;(A' (“) “ ^)) 
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M e = f{A^{a) - {(j),b)) - u(^(a) - {(j),b)) 

(v) (a) e = /(c/+(u) U a) - /(c/+(y)) 

(bj € = f{cl~{v) U a) - f{cl~{v)) 

Sinc6 thG numbsr of co-ordinEtGS of w diffisr frorn that of v ar6 Githar oug or two in all 
the five cases from [33], it follows that w is also an extreme point of P{f). When / is 
an increasing function the sots X^(a) can be calculated in o{n^) time as in the case of 
polymatroids [9]. 

We now state and prove the Min-Max theorem for a bisubmodular systems and discuss how 
it can be used for computing the exchange capacities e defined in the previous theorem. 
This result has also been proved by Cunningham [1C] and Fujishige [28]. Our proof differs 
from both. 

Notation: For a given orthant (X,Y) and y € P{f) 

Vfx.y) < b Vf < b V(! e A" and > 0 Ve e F We use y < 0 instead of y^s) < 0. 
F(/(A',r)) is defined in chapter-3. 

4.4 Min-Max theorem for a bisubmodular system 

Theorem 4.4.1. Let f be a given bisubmodular function with (j)) = 0. Let x° € 7^^. 
If there exist a y S P{f) such that y < x°, then 

max {y{E) ~ x^{E) : tj < y G P[f)] = mm {/(.4. B) - x°(A, B) : (.4, B) G D{E)} 

(4.4.1) 

Moreover if f and x are integer valued then the maximum in the left-hand side of (4-4-1) 
is attained by an integral y. 

Here we prove the theorem by first showing that it holds for an orthant by using the 
fact that f{<p,(f) = 0. 
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Lemma 4.4.1. Let f be a given bisubmodular function with /{f, cf) = 0. 

(X. Y) be any given orthant and if there is a vector y e P(/(A-,y)) such that < 0, then 

max {t/C.Y, Y) : < 0; y € P(/(A',r))} = min {f{A, B) : {A, B) C (X Y)} 

(4.4.2) 

Moreover if f is integer valued then the maximum value attained by y on the left-hand side 
of equation (4-4-^) an integer. 

Proof. Define a submodiilar function fi oxi E = X U Y as follows: 

For any A' C E. let fx{A') = f{Ai,Bi) where .4i = .4' n .Y and Bi = A' n Y 
.4s shown earlier in (;haprer-3, 

min{f{.A. B)\ (.4, B) C (.Y, 1')) = mm(/i(.4') | V .4' C E) 

Since /j is a submodiilar function, by the niin-ina.x lelationship for a submodular system 
in [IS] 

mm{fx[A') I V .4' C E) = max{y{E) | y < 0,y € P(/i)) (4.4.3) 

If / is integxu' valued then /j is also integer valued, moie over fi is a submodular function 
and hence th<’ vi'ctor y that attains maximum at the right-hand side of the equation (4.4.3) 
will be an inregiu'. 

.4gain we have* 

r,mxUj(E) i ,/ <(),!/ 6 P(/,)) = max (y[X. Y) \ <a.ye P[hx.Y))) 

and henct' the result. □ 

Now we prove the theorem (4.4.1) 

Proof. We prove the theorem by making use of lemma (4.4.1). 

Let us first prove the simple case when x° = 0. Since x° = 0, by the assumption there is a 


y € P{f) such that y < 0. We want to prove that 

max {y{E) : y < 0, y € P{f}} = min {/(.4. B) : (A, B) G D{E)} 


(4.4.4) 
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By the above lemma (4.4.1) for any orthant (X,Y) vve have 

max {j/(A . } ) i < 0, y € P(/(A',y))} = min {f{A.B) | {A,B) C (X, F)} 

(4.4.5) 

Since there is a y € P{f) such that y < 0, equation (4.4.5) is valid for any orthant 
(X.Y) Take the min on both sides of the equation (4.4.5) over all orthants {X, Y) e D{E). 
The right hand side of equation (4.4.5) becomes. 


(Y F) ^ (A',F)}) =mzn{/(,4,B) : {A,B) € D{E)} 

Similarly the left hand side of equation (4.4.5) becomes, 


(4.4.6) 


(ma,r {//(A. F) | ^ , < 0, y € P{f(xx])}) = max {y{E) | y < 0, y 6 P(/)} 

^ ^ ^ (4.4.7) 

Since. 

max {y(E) | y < 0, y € P(/)} < max {y(A, F) | y, < 0,y e P{f{x,Y))} Vorthant (A,F). 

Combining (4.4.6) and (4.4.7) wc obtain equation (4.4.4). 

To prove the intt'grality of y, for any orthant (X,Y) if / is integer valued then the maximum 
on th(^ left hand .side of ('cjuation (4.4.5) is attained by an integral y, by lemma (4.4.1). 

Since \v(' are taking flu' ininirnurn over all orthants of D(E), the maximum on the left-hand 
side of equation (4.4.4) is akso attained by an integral y. 

Now \v(' prove the min-inax theorem for any a-" e RP. Consider the left hand side of 
equation (4.4.1). 

max {{y - x"){£’) : y < y e P{f)} 


Let us define a bLsubrnoduIar function g as follows: 

y(.4. S) = /(.4, B) - x%4, B). 


(4.4.8) 
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^ Vy € P(f), {y - x°) € P{g). 

Set y' = y - x° and equation (4.4.8) reduces to the following 

max{[y - x°){E) \ y <x°, ye P{f)} = max{{y'{E) | y' < 0, y' e P{g)} 

= rmn{g{A,B) ] (.4,S) e D{E)}. 

(From equation 4.4.4) 

Replace g by / and // = y — x°, we get 

max{yiE) -x°{E) [ y<x\ y € P(/)} = m%n{f{A,B) - x°{A,B) | [A,B) e D{E)} 
Hence the theorem □ 

Min-.Max theorem is not valid when /(^^. (i>) < 0. It has been shown in chapter-2 that, 
when /(<;;) < 0, tlum min-rnax tluHiroin in no longer valid for the submodular system. 

In equation (4.4.3), we use tlu; min-inax theorem for submodular systems. Since /i(0) = 
f{o,d) < 0, ('ciuation (4.1.3) may not be valid and hence the min-max relation may not 
hold for some orthant and lumce may not also hold in general. 

Computation of exchange capacities 

Using the min-max ndation we give riu'thods for computing the exchange capacities, defined 
in theorem (4.3.1). 

We giv(^ the method to calculate the value of e for Cas(!(l). The values of e in other cases 
can be computi'd in a similar way. 

Let E' - E - {«./;). Define functions /„ and fb on D{E') as follows 
fa{XA') = f{X U a, r) and MX, Y) = /(.Y , F U 6) V(.Y, F) € £>(^0 
Now define 

Va{X, Y) = n(.Y U a, i') and t;6(T, Y) = v(X ,YUd) 

Clearly /„ and /j are bisubmodular functions on D(E') and 
mzniMX, Y) - vJX, Y) I (X Y) 6 D{E')} 

= minifiX, Y) - y(X Y) | a € .Y and b i (X U F) , (AL F) € n(E)} = ej 
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Similarly mm{/6(-X’, 1 ) Vb{X^Y) | {X,Y) G D{E')} 

= m^n{/(X y) - v{X. Y) \ b eY and a ^ {X U Y). (X 7) G D{E)} = e, 
min[ei,^2) ~ ^ 

Since /a, h are bisubmodular functions fa-Va . /& - Vb are also bisubmodular functions. 

Let KiX,Y) = {fa -Va)iX,Y) and MX 7) = (A - n,)(X, 7) 

In order to apply the min-max relationship the value of / on empty set must be zero. So 
define 

(?a(X 7) = ha{X, 7) - ha{4>, <f) and gb{X, 7) = hb{X, 7) - hb{(j), 

mm{/a(X,7) -t^(X,7) | (X,7) G D{E')} = <^) +mzn{^a(X7) 1 (X,7) G D{E')} 

= ha{4>. 4>) + (y(F) 1 y < 0, ye P{ga)) =ei 

Similarly ea can also be calculated. 

From the above construction, computing exchange capacities for v G P{f) and v is not an 
extreme point, seems to be as difficult as that of minimising a bisubmodular function. 

4.4.1 Separation problem 

In this section the relationship between bisubmodular function minimisation and separation 
problem is given. 

Defmition 4.4.1. Separation problem is to determine, given a vector x and a polyhedron 
P{f) whether x e P[f) or not. If not, then to find a inequality of the system defining P[f) 

that ts most violated by x. 

As far as a submodular system is concerned it has been shown in (9j that the separa- 
tion problem for a given .x and P{f) can be solved by minimiring a snbmodnlar function g 
defined by g(A) = f{.i) -x{A). Conversely the general submodular function mimmization 
problem can be reduced to a separation problem. Now we will see this relationship in 
terms of a bisubmodular system. Let f be any given bisubmodular function and . the 
given vector. The separation problem for x and P(f) is equivalent to that of mimmimg 
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a bisubrpiodular function g, where g{A,B) = f{A.B) - x{A.B). 

If the set that minimises g is {6, (j)), then xe P{f) and if the minimum occurs at (C, D) 
then x(C, D) < /(C, D) will be the most violated mequalit}- of the system defining P{f). 
Let us see the converse part. 

The converse problem is to find a a: and a polyhedron P{j] such that minimising a bisub- 
modular function g is equivalent to that of solving a separation problem with x and P(/). 
The obvious choice here is ar = 0 and f — g By the Min-Max theorem of the previous 
section, one can easily see that minimising a bisubmodular function g is equivalent to the 
separation problem for a: = 0 and the polyhedron P(/), where f = g. 

However, is it possible to find a x and / such that / is also an increasing bisubmodular 
function? 

For a submodular system {g, D), one can always construct a x and / increasing as in [9]. 
Let Xj = g(E - j) - g{E) Vj e P, then 

f{A) = ^(.4) -f x{A) — g{(j)) is non decreasing submodular function. 

Solving the separation problem for this x and P(/) will also give the minimising set for g. 
The validity of the above construction depends on the facr that E is the unique maximal 
subset in D. 

But this is not the case for D(E). The number of maximal sets (orthants) in D(E) is 2'^. 
Thus given a bisubmodular function g it is not easy to find a j G 'RP, such that f = g-\rx, 
IS an increasing bisubmodular function. 

However for a given orthant (A'^, Y) and a bisubmodular function g one can find a x, such 

that f = g + X increasing with respect to (X,Y) as follows 

Let g be a given bisubmodular function and (X,Y) a given orthant. 

Let x*{j)= g{X - j, Y) - g{X, Y) if j G T 
= g{X,Y-j)-g{X,Y) ify-GY. 

Define /(A, B) = g{A, B) +x*{A, B) - g{(j), <i>), then / is increasing with respect to directed 


) 
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subsets of (X,Y). 

One can easily verify that minimizing the bisubmodular function g over the given orthant 
(X.Y) is equivalent to that of solving the separation problem with x* and P{f(XY)), where 
P{f{x,Y)) is defined in chapters. 

It IS possible to give a pseudo polynomial algorithm for the separation problem for an 
increasing bisubmodular polyhedron, by adapting the algorithm given in [9] for a polyma- 
troid. 

In the previous section we have shown that for an increasing bisubmodular function /, it 
is easy to obtain the sets -4^(e) for all v € P{f) We have also shown that obtaining 
adjacent extreme points of an extreme point in P{f) will require o(n^) effort. Given that 
d) = 0, implies x = 0 € P{f) and hence a; = 0 can be used as a starting point for the 
algorithm in [9]. 

Another pseudo polynomial algorithm for the separation problem for a polymatroid [51] 
requires that a polynomial time algorithm is available for solving a LPP over the given 
polyhedron and that adjacent extreme points can be obtained for any given extreme point 
of P{f), for it to converge. 

Since the increasing bisubmodular polyhedron satisfies both the above conditions, the algo- 
rithm in [51] can be easily adapted for solving the separation problem for this polyhedron. 
Integrality of / will ensure faster convergence. 

If there is an oracle for minimising a bisubmodular function then by using that oracle we 
can minimise bisubmodular function with respect to some simple constraints that occur in 
some practical problems. 
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4.4.2 Minimizing Bisubmodular functions subjected to some sim- 
ple constraints 


In chapters we have seen that for / a bisubmodular function defined on D{E), for a given 
diset {X,Y)eD{E) 


is equivalent to 


min 

(A,B)C{X,Y) 


fiAB) 


min{f{A)) where E = {X U Y) and /is a submodular function defined as follows 

ACE 

f{A) = f{Ai,A 2 ], where Ai = {A n A") and A 2 = (4 fi Y). 

In the lemma below we detail a scheme for finding a nonempty minimising diset for a 
bisubmodular function. We state the problem as: 


min f(A,B) 


(4.4.9) 


Lemma 4.4.2. Problem (4-. 4-9) can he solved by using an oracle for minimising a bisub- 
modular function not more than 2\E\ times. 


Proof For j G F^,set E^ = E - j. 

Define 

B) = f{A U j, B) V(.4, B) G Dm and 
/(^,,)(.4,5) =/(.4,BU/) V(A,B) G Dm 
Clearly the above two functions are bisubmodular on D{E^). 

Now using the oracle for minimising a bisubmodular function find 


min {ffjMA,B)) 

(/i,n)e£>(£') 

min (f(^,)(A,B)) 

Similarly for all other j E E find the minimising disets for the corresponding 
functions. Among all the minimising sets find the one which corresponds to 
value. The set we obtain is the minimum nonempty set that minimizes f. 


bisubmodular 
the minimum 
□ 
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Consider the following constrained bisubmodular function minimisation problem 
For a given i £ E, 


mm f{A, B) 


Lemma 4.4.3. The above problen can be solved by using the oracle for minimising a 
bisubmodular function not more than 2 times. 


Proof Let E^' = E\i 

min f(A,B) = f{Ai,Bi).s&y 

{A,B)CD(E') J \ u j 

Now define f(^^^){A, B) = f{A, B U i). Clearly is a bisubmodular function on D{E^). 

™'’^{A,B)QD{E' ) 

Let f(A,B)=min{f(Ai,Bi) , /(/I 2 .B 2 )} 

(A,B) is the required minimum set. □ 


Given a diset (A", Y) € D{E), such that |A'’UF| > 2. we now want to find a minimising set 
(A,B) that has atleast one element in common with (X,Y) and no more than (|AUF1 — 1) 
elements. The problem can be stated as follows: 


min 

{4>,<i>)C{A.B)eD{E) 


(f{A,B) : 1 <1 (A, B) n (.Y, Y)\<\XUY\ -1) 


Lemma 4.4.4. The above problem can be solved by using an oracle for minimising a bisub- 
modular function almost \E‘^\ times. 


Proof. Let (X,Y) be the given nonempty diset. 

Let {i,(j)) C {X,Y). Note that here i can also be a backward element of (X,Y). With out 
loss of generality we will assume that i is a forward element. 

Define /(i,^)(A, B) = f(A U i, B) V(A, B) € D{E\i) 

Since | A U F |> 2 ,let (y, (p) C (A, F) 

Here also j can be backward element of (X,Y). 



76 


Now find 


mm 

{4>,4>mA,B)^D{E\^) 




(j.#) i (A,B)) 


One can find the above minimum by using lemmas (4.4.3) and (4.4.4). 

Now repeat the above steps by fixing one element of (X,Y) and varying the others.Find 
the minimum among all the sets. So the complexity of the algorithm is o (IF'P). □ 

There are practical situations where we want to minimise a bisubmodular function with 
some simple constraints as above. 

In many problems, the diset {(p, 0) will be a trivial solution for minimising a bisubmodular 
functions. However we need only non trivial solutions. In such situations lemma (4.4.3) 
will be useful. If (X,Y) is a given diset with lA" U Y|=even, then one is required to find a 
minimising set (A,B) of / such that |(/1, H) fl (A', F)|=odd. Lemma (4.4.5) can be used 
to solve the above problem. These type of problems occur while solving the separation 
problem over P{f) defined by odd set constraints with respect to (X,Y). It has been proved 
in the previous section that the separation problem is equivalent to that of minimising a 
bisubmodular function. 



Chapter 5 


Minimum cost Network and 
Submodular flow problems 

5.1 Introduction 

The minimum cost network flow problem is one of the fundamentals of all network flow 
problems. The cycle-cancelling algorithm was one of the first algorithms developed to solve 
this problem. This algorithm maintains a feasible solution at every iteration. It proceeds 
by identifying negative cycles in the residual network and augments along these cycles until 
there are no negative cycles in the residual network. The cycle cancelling algorithm was 
suggested by Klein [39] and the generic version of this algorithm is known to run in pseudo- 
poh'nomial time. Goldberg and Tarjan [31] developed a strongly polynomial algorithm 
which cancels minimum-mean cycles. To obtain a better bound, they used the idea of 
indirect cost scaling with cycle-cancelling; cycles composed entirely of negative reduced 
cost arcs are cancelled and optimality violation of arcs are tightened by updating node 
potentials. The other cycle-cancelling algorithm by Barahona and Tardos [7] augments 
flow along all negative cycles with maximum improvement. Sokkalingam [52] developed 
an algorithm which augments flow along a negative cycle with sufficiently large residual 
capacity. 

Submodular flow problems are the generalization of classical network flow problems. They 
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are the network flow problem with boundary constraints described bv the submodular 
functions. A straightforward algorithm for solving submodular flow problem is the cycle 
cancelling algorithm. 

In this chapter we give a new cycle cancelling algorithm for the minimum cost network 
flow problem and for solving the submodular flow problem. 

Our cycle cancelling algorithm for minimum cost network flow problem works as follows: 
We propose a search procedure which looks for augmenting cycles as well as dynamically 
updates potential of the node blocking the search. The basic idea is to reduce optimality 
violation of all arcs incident to the node blocking the search by half. The important feature 
of our algorithm is that it simultaneously finds negative cycles and tightens optimality 
violations by using the basic graph search algorithm. 

We extend the algorithm to solve the minimum cost submodular flow problems. 

The organisation of this chapter is as follows: 

We first review some relevant definitions and results. Next we propose a new version of 
cycle cancelling algorithm. Following this we deals with the complexity analysis. We then 
study the computational experimentation of our algorithm. Here we compare our algorithm 
experimentally with the relaxation algorithm of [8] Computational results are also given. 
The c-code for our algorithm is given in the Appendix. At the end we propose a new cycle 
cancelling algorithm for the minimum cost submodular flow problem. 

5.2 Preliminaries 


In this section, we present network notations and review some well known results used in 
the subsequent sections. 
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Problem Statement 

Let G = {V, A) be a directed network defined by a set V of n nodes and a set ^ of m arcs. 
Each arc {ij) has an associated cost and an associated capacity u^J. We denote by C 
the largest magnitude of all arc costs. We associate with each node leV & number b{i) rep- 
resenting its supply/demand such that EteAr b (i) = 0. The minimum cost flow problem is 
to determine the flow x^J on each arc A which solves the following decision problem: 


Minimize 


( 5 . 2 . 1 ) 


subject to 




E 


{j : (iJ) € A} {j:ijA)eA} 


b{i) for all i eV, 


(5.2.2) 


0 <Xtj < Uij V {i,j) € A. (5.2.3) 

We refer to a flow x satisfying the mass balance constraints (5.2.2), and the flow bound 
constraints (5.2.3) as a feasible flow. Our algorithm maintains a feasible flow at every 
step and proceeds by improving its objective function value. Without any loss of generality, 
we assume that the minimum cost flow problem ((5.2.1) — (5.2.3)) is feasible and its optimal 
objective function value is finite. Verifying both the conditions will not require much effort. 
Existence of a feasible flow can be answered by solving a max-flow problem and finiteness of 
the objective function is guaranteed by the absence of uncapacitated negative cost cycles. 
This can also be verified in o(nm) time. 

Residual Network G (x) 

The residual network G(x) corresponding to a feasible flow x is defined as follows: For 
each arc (i,j) € A with Xjj < Utj replace (i,j) by an arc (t, j) with cost Cjj and residual 
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capacity = u^j — Xtj. For each arc (i.j) with Xij > 0 replace (i,j) by an arc {j, i) with 
cost Cjj = -Cjj and a residual capacity = x^J. The residual network consists only of arcs 
with positive residual capacity. 

For a given set tt of node potentials, the reduced cost of an arc (i,j) in the residual 
network is = Ci_, — 7r(z) + tt (j) . An arc (i,j) G G(x) is said to be eligible if < 0. 
Our algorithm often works with reduced costs in place of arc costs and makes use of the 
following well-known property: 

Property 1. For any directed cycle W in G(x) and any set of node potentials tt 

= E(ij)gwCij- 

Minimum Cost Optimality Conditions 

Optimal solutions of the minimum cost flow problem can be characterized in a variety 
of ways. The theorems below give these characterizations and some related results. For 
details see Ahuja et al.[3]. 

Theorem 5.2.1. Negative Cycle Optimality Conditions. A feasible solution x is an 
optimal solution of the minimum cost flow problem if and only if the residual network G (x) 
contains no negative cost directed cycles. 

Theorem 5.2.2. Reduced cost optimality conditions. A feasible solution x is an 
optimal solution of the minimum cost flow problem if and only if some potential tt satisfies 
the following reduced cost optimality conditions: 

Cy > 0 for every arc {i,j) in G(x) 

Theorem 5.2.3. e— optimality conditions; A flow x is said to be e optimal for some 
e> 0 if X together with some node potentials tt satisfies the following optimality conditions: 

Cy > — e for every arc (i,j) in G(x). 


We will also denote e as a measure of violation of optimality. 
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Theorem 5.2.4. For a mimmum cost flow problem with integer costs, any e-optimal fea- 
sible flow IS an optimal flow z/e < 1/n. 

Theorem 5.2.5. Suppose that x is an optimal flow with respect to the potentials tt and 
suppose that for some arc {k,l) G A, |cyj > 2n€. Then the flow on arc {k,l) remains the 
same for all e' -optimal flows for e' < e. 

5.3 A New Cycle cancelling Algorithm 

Reducing Optimality Violation by Dynamic Sharing 

In our algorithm, the main procedure searches for a negative cycle and when the search 
is blocked at a node, dynamically shares the optimality violation at the blocked node, 
if it is sufficiently large. Lid. eo denote the current maximum violation. The aim at each 
iteration is to rc'diu-c' the optimality violation of arcs incident to nodes visited by the search 
procedure to below fo/'i at its termination. We call this procedure as dynamic-share, 
which is described in details below. 

To find a negative' cycle, we start building a path by successively adding an eligible 
arc to the tip of the path. For that, we choose a node with backward eligible arcs, whose 
maximum violation is greater than eo/2. as the origin of a path. Initialy, the path contains 
no arcs and the tip of the origin is same as the the origin of the path. When we retreat 
from the origin, the path becomes null. 

We continue adding an eligible arc to the tip of the path until either of the following 
conditions results: 1) aiiding an arc forms a cycle, 2) there is no forward eligible arc at the 
tip. In the former case, we improve the solution by augmenting flow along the cycle, we 
start the procedure again at the first node common to both the path and the cycle. 

In the later case, the search is said to be blocked at the tip. We calculate the maximum 
violation, say a, of backward arcs incident on the tip. If a > e’o/2, we share, that is we 
increase the potential of the tip only by a/2 ; note that always a < eq. Hence, after the 
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update, optimality violation is shared equally between forward and backward arcs. We 
refer to this update as sharing update. Otherwise, we transfer, i.e., we increase the 
potential of the tip by a. After this update, the whole optimality violation of backward 
arcs are transfered to forward arcs. We refer to this update as transfer update. After 
potential updates, we retreats to the predecessor of the tip on the path. We terminate the 
procedure when we have retreated to the origin. 

In what follows is an example network and its corresponding residual network and an 
example of sharing update and transfer update. 



( 0 , 5 ) 


5 


( 4 . 4 ) 
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Now max {|c,j|} = €<, = 18 and eo/2 = 9 

At some stage of the iteration we have the path as gn-en below with and shown along 
the arcs. Now the path is blocked and the tip node is 3. 

TRANSFER UPDATE 



That IS no forward eligible arc from node 3. 

Here a = max violation of all backward arcs at node 3. Here the only backward arc at 3 
is the arc(l,3) which has optimality violation 4. 

So a = 4 < 9 and hence we have a transfer update Set 7r(3) = 7r(3) + a = 4 and the tip 
to the predecessor of (3). After the update the path is. 



Now one cannot move from node 1, as there is no forward eligible arc. Calculate the 
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maximum violation of all backward arcs at 1. Here (4,1) and (5,1) are two backward arcs 
at node 1, 

a = max(10, 18) = 18 > eo/2 = 9, 

and hence we have to update the potential of the tip node by sharing update. 

So 7r(l) = 7r(l) 4- Q!/2 = 9 

After this update you can either move from 1 to 2 or from 1 to 3. If you choose the path 

SHARING UPDATE 


0 0 9 4 



-4(5) 


CYCLE 


from 1 to 2 then there is a cycle and hence augmentation takes place. 


Algorithm: Dynamic Share 

The algorithmic description is given below. Let r refer to the origin. Path refer to the path 
built by the algorithm from the origin to the tip, and t to the tip of Path. 
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Procedure Dynamic sliare(r, eo) 
begin 


set T as the root of Path; 
while Path is not null do 


begin 

find an eligible forward arc of t, if it exists; 
if successful then 


begin 

add the arc to t; 
if cycle is formed then 
begin 

augment flow along the cycle formed; 

set the tip t to the first node common to both path and cycle; 


end; 


else set t to the head of the new arc; 


else 

begin 

compute the maximum violation of backward arcs; 
if it is greater than the critical limit then share; 
else transfer; 

set the tip to the predecessor of t; 

end; 

end; 


end; 
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We now prove some properties of the procedure dynamic_share. Let bmin(t) denote 
the minimum of reduced costs of backward arcs incident on t, and bviol(t) its absolute 
value. The following proposition is straight forward. 

Proposition 5.3.1. Increasing the potential of a node by 5 units (i) decreases reduced cost 
of each forward arc of the node by 5 units, and increases reduced cost of each backward arc 
by 6 units. 

Lemma 5.3.1. After a transfer-update at a node, (i) there will he no backward eligible 
arcs, (a) all forward arcs will have optimality violation < 

Proof, (i) Let us assume that the transfer update took place at a node t. The backward 
arcs at t are {ii,i 2 ...,ir) say. Let to denote the current maximum violation. 

Since it is a transfer update bviol{t) < eo/2 and —hviol{t) < 'i k — l...r 
After the update rr'^t) — ^{t) + bviol{t) 

The reduced cost of all backward arcs after the update are as follows: 

— ‘^ih) + 7r'(t) = Ct,^t - + 7r(t) H- bviol{t) > 0 V AT = 1 .. r. 

Which implies that all the backward arcs are ineligible 

(ii) Before the transfer update, there were no eligible forward arcs at t, that is the optimality 
violations are zero for all the forward arcs at t. We decrease the reduced cost of all the 
forward arcs by a quantity bviol[t) < to/ 2 and hence the optimality violation of all the 
forward arcs are < eo/2 □ 

Lemma 5.3.2. After a sharing-update at a node, each forward and backward arc will have 
optimality violation less than eo/2, where eo is the current optimality violation of the flow. 

Proof. Let us assume that the sharing update took place at a node t. So bviol{t) > eo/2. 
(Note that hviol{t) < Co) 

After the update 7r'(t) = 7r(t) + bviol{t)/2 
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By the above proposition (5.3.1) reduced cost of all forward arcs will get decreased by 
the amount hvzol(t)/2 and for backward arcs will get increased by bviol{t)/2. Before the 
sharing update, there were no forward eligible arcs, that is optimality violations of all the 
forward arcs at t were zero and hence the optimality violations of all the forward arcs at t 

<eo/2 

As far as the backward arcs at t are concerned, it is easy to see that increase in the reduced 
cost by the amount hviol{t)/2 makes their optimality violations < eo/2. □ 

Observ^e that no eligible arcs are created during flow augmentation, because after augment- 
ing flow along a cycle, only the reverse of the arcs on the cycle get added to the residual 
graph and by definition these arcs will have positive reduced cost and hence will not be 
eligible. Arcs in the cycle saturated by flow augmentation drop out of the residual network. 
Operations augmentation and update neither introduce new eligible arcs nor change the 
reduced costs. With these observations, we now prove the following theorem. 

Theorem 5.3.1. Let e = eo/2, where eo is the optimality violation of the flow, denote 
the tcU*get-approximation. Procedure dynamic-share has the following properties, (i) 
Between two consecutive sharing-updates, any node is (transfer) updated at most once. 
(ii)At any node, there will be at most one shanng-update. (hi) At the termination, arcs 
incident to nodes visited by the procedure have optimality violation less than or equal to the 
target-approximation. 

Proof, (i) Let a transfer update take place at tip t. By lemma 5.3.1 (i), there axe no 
backward eligible arcs at t, and since after the update the arc (predecessor(t),t) has become 
ineligible, we begin the search from predecessor(t). In order to reach node t, a sharing 
update must take place at one of the tails of an arc incident on t. Hence another transfer 
update at t can take place only after a sharing update has taken place. 

Thus we have proved statement (i). 
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(ii) A sharing update occurs at a node, say t, only when bviol(t) > e. By Lemma (5.3.2) 
, after the update, bviol(t) is < e. Also, by Lemma (5 3.1) and Lemma (5.3.2) together 
with the observation in the para preceding theorem(5.3.1), optimality violation of new 
eligible arcs is < e. Hence bviol(t) continues to be < e after sharing-update. Hence there 
will be no more sharing-updates. 

(iii) Optimality violation of all forward arcs of a node is < e after an update at that 

node. Optimality violation of forward arcs may only decrease due to augmentation and 
updates at other nodes. Optimality violation of backward arcs is < e after any update (it 
is zero after transfer updates) . Optimality violation of backward arcs may increase due to 
updates at other nodes. As we pointed out in the previous para, optimality violation of 
new eligible arcs is < e. Hence optimality violation of backward arcs of any node visited 
remains < e. Hence statement (iii) follows. □ 

5.3.1 Current Arc Implementation. 

We implement the algorithm using the forward-backward-star data structure. This 
enables us to update and calculate the reduced cost on any arc in 0(1) time as well scan 
forward star (backward star) of any node z inO {\A (z)]) (O (\B (?)()) time. Where A(i)=set 
of all forward arcs from node i and B(i)= set of all backward arcs from node i. 

Further, to reduce the time for finding eligible arcs at the tip of the path, we use the 
current-arc data structure on top of the forward-star data structure. For each node i, 
we define a current arc (i,j), as the arc we wish to examine for eligibility when node i is 
visited. With this data structure we can do following operations in O (1) time. 
set(i). =(i,j), where j is the first node for which (i.j) G forward star of i. 
get-arc(i). Return the current arc of node i. 

move-arc(i). If the current arc is not the last arc of forward star of node i, set the 
current arc to the next arc in the forward star; otherwise, return end of list. 
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To use the current-arc data structure in the procedure dynamic-share, we set the current 
arc of each node to its first forward arc; when a node becomes the tip by get-arc operation, 
we get the current arc of that node, and test whether the arc is eligible and has non-zero 
residual capacity . If the answer is yes then, current arc of the node is added to the path; 
otherwise, the operation move-arc(i) is used and residual forward arc is found. If no such 
arc can be found, end of the forward star will be reached and the node will be updated 
and the current arc will be reset to the first arc of the forward star of the tip node. The 
path will retreat to the predecessor of the node. 

We will verify the correctness of the implementation. Let i be any node. It is enough 
to prove that there is no eligible forward arc, at node i, when the current arc reaches the 
end of the forward star of node i. This can be seen as follows: 

The current arc moves to the next arc in the forward star of node i only when it becomes 
ineligible or its reduced capacity becomes zero, (ie, the arc drops out of the residual 
graph). Hence all the arcs of the forward star of node i scanned before the current arc 
remain ineligible and (or with zero residual capacity). So at the end, all the arcs in the 
forward star of node i will become ineligible. 

Successive Dynamic Sharing algorithm 

In this section, we describe the procedure which finds minimum cost flow by applying 
iteratively the procedure dynamic share described m the previous section. For the moment, 
we assume that costs are integral. 

The algorithm starts with a feasible flow and a set of node potentials tt. Initialy, we 
can start with tt = 0. Each major iteration consists of the following steps. We first find 
the maximum optimality violation over all arcs, say d If ^ < 1/n, we have an optimal 
flow and we terminate the algorithm. Otherwise we set the target approximation, say 
6, for procedure share to be applied in this iteration as 9/2. We select a node, say r, with 
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eligible backward arcs whose maximum optimality violation is greater than e. We call such 
a node as candidate node. We apply the procedure d}-namic_share setting node r as the 
root node. The next iteration starts when there are no candidate nodes. 

5.4 Complexity Analysis 

We now analyze the complexity of the algorithm. xA.ny arc (i,j) may become an eligible 
arc again during the procedure when the potential of its tail i increases. Note that flow 
augmentation does not create any eligible arcs. Since the potential of any node increases 
at most n times during the procedure, any arc can becomes eligible after being ineligible 
atmost n times . Since augmentation removes at least one arc from the residual graph by 
saturating it, this observation implies that there can be at most 0{nm) flow augmentations. 
Since each augmentation takes 0{n) time, the time taken by the flow augmentation is 
0{rv‘m). 

We now account for the time taken by arc scanning. Between two consecutive scanning 
each ineligible arc is scanned at most once. But, an eligible arc may be scanned when 
path retreats either (i) after augmentation or (ii) after sharing-update. We can include the 
time taken by arc scanning in (i) to the flow augmentation time, and the time due to (ii) 
to the number of sharing updates which is 0 (n). Since the number of updates at a node 
is 0 (n), the time for arc scanning is 0 (T,g,Y 1--1(*)|) = O (nm) . The time taken by the 
retreat operation is equal to the number of updates, which is O (n^). The time taken by 
operations add-arc can be charged to the operations flow augmentation and retreat. Thus 
overall complexity of the procedure is 0 {irm). We now state the above observations in 
the following lemmas. 

Lemma 5.4.1. Using the current-arc data structure, procedure dynamic-share can he 
implemented in O (n^m) time. 
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Lemma 5.4.2. Each major iteration of the algorithm successive dynamic sharing reduces 
the maximum optimality violation by a factor of 2 and runs in O (n^m) time using the 
current- arc data structure 

The maximum optimality violation e of any feasible flow is at most C at the beginning 
of the algorithm. Since each major iteration reduces optimality violation by a factor of 2, 
after 0(log(nC')) iterations optimality violation will reduce to 1/n. By Theorem (5.2.4), 
when costs are integral, flow obtained after O (log(nC)) iterations will be optimal. 

As with other cost scaling algorithms, this algorithm can be modified for real data 
costs using the following well-known technique. We divide major iterations into a group 
of log f«l + 1 iterations, and each iteration starts with solving a minimum mean cycle 
problem. Let A denote the minimum mean value. By adding A to each arc in the residual 
graph and solving a shortest path problem, we can update potentials such that e = A. In 
particular, each arc in the minimum mean cycle has its reduced cost equal to —A. After 
log [n] + 1 iterations in the group, e < A/2n; this implies at least one arc in the mim'innm 
mean cycle found at the start of this group of iteration will have reduced cost > 2ne. Fix 
the flow on that arc. Details can be found in Goldberg et. al.[31] and Ahuja et. al [3]. 
This implies we will get the optimal flow in m group of major iterations, each fixing flow 
on one arc. Thus, incorporating the minimum mean cycle routine as above, the algorithm 
solves the minimum cost flow problem in O (mlogn) major iterations. 

Theorem 5.4.1. Successive Dynamic Share Algorithm performs O (log (nC)) iterations 
for integral data and runs in 0 (n^mlog(nC)) time. By incorporating minimum mean 
cycle routine into the algorithm, for real data, the algorithm performs O {m log n) iterations 
and runs in O (n^m^ log n) time. 
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5.5 Computational Experimentation 

5.5.1 Introduction 

Historically, computational experimentation has been the primary method for comparative 
evaluation of network flow algorithms. During the 1960’s it was generally believed that 
primal-dual methods hold advantage over primal simplex methods. 

However in 1970’s use of sophisticated data structures propelled primal simplex algo- 
rithms to a position of prominence for solving general minimum cost flow problems. The 
different primal-dual algorithms in use are the out-of-kilter method, Relaxation algorithm 
and the successive shortest path algorithm. Almost all the primal methods are variations 
of the cycle-cancelling algorithm. 

Bertsekas and Tseng [8] compared their relaxation code(RELAX-II) with two state-of 
the-art codesrKILTER (a primal-dual code due to Aashtiani and Magnanti) [1] and RNET 
( a primal simplex code due to Grigoria and Hsu 1980). Among the available codes at that 
time it appears that RNET had produced the fastest computations for the NETGEN [40] 
bench mark problems. 

But the results of (Bertsekas) [8] show the substantial superiority of the RELAX-II code 
over the other codes. For the Assignment and Transportation problems computational 
time of RELAX-II code is almost flve times better than KILTER and RNET, and for 
uncapacitated and lightly capacitate problems it out performs the other codes. The margin 
of superiority increases for large problems. 

In this section we compare our algorithm, described in the previous section with the 
RELAX-II code. 

The test problems used are the standard bench marks generated by the publicly available 
NETGEN program [40]. We call our code as CYCLE-code. 
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5.5.2 Computational Implementation 

as the implementation is concerned, in each iteration during updating potential at 
tip nodes, division by a factor of 2 occurs thus making the value of e non integral. However 
by slight modification of the costs we can ensure that e remains integral. We do so by 
multiplying arc costs by number of nodes n and setting the initial value of e = 
stop the algorithm when e < 1. 

But while doing so, the values of costs become very large, thus making the computational 
efficiency less. So to avoid this we do the following- 

Start the algorithm with the given costs, each time while calculating e/2 take its integral 
part and stop the algorithm when e < 1. Now the flow is e-optimal but not optimal. To 
make it optimal we apply the shortest-path algorithm as follows. 

For each node calculate its demand (supply) as follows Let b(i) denote the supply(demand) 
for the node(i) Set b(i)=0 for all the nodes. For every admissible arc (i,j) do the following: 

begin 

Utj) and (Cj^ 

begin 

b(i)=6(z) - (u„ - x,j) and b(j)=b(j)+(uij - x,j) 

Xjj —V-ij 

end 

if((xy > 0) and = 1)) 
begin 

b(i)=b(i)+x,j and b{j) = h{j) - x,j 

x^J=0 

end 


end 
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After the above steps, nodes with b{i) > 0 are supply nodes and with b{i) < 0 are demand 
ones. Now select a supply node i and a demand node j and find the shortest path from 
node i to j, with reduced cost as the distance from node i to j. 

After finding the shortest path augment the flow along the path and also calculate the 
distance of the shortest path from i to j, for updating the node potentials. Change the 
supply and demand for nodes i and j respectively and proceed. These steps are explained 
below. 

Shortest-path(iJ) is a procedure to find the shortest-path from i to j. 

Repeat 

begin 

(Let i be the supply and j be the demand node) 

CALL shortest-path (i,j). 

Augment the flow along the path. 

Let delta be the amount of flow augmented. 

b(i)=b(i)-delta; 

b(j)=b(j)+delta; 

For all the nodes in the shortest path change the potentials as follows. 

TTj = TTj — +distance of the shortest path, 
end 

until( demand and supply for all the nodes are satisfied.) 

Now the flow is optimal. 

5.5.3 TEST CONDITIONS 

All problems were tested under identical conditions. We worked on HP-9000/800 main- 
frame series. We use the language C. The RELAX-II code available to us was in Fortran-iv. 
We used FORTRAN to C converter and convert the Relax-II code to C language. 

We obtain the CPU times using system command TIME in Unix. The time command will 
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give three times, real, user, system. The CPU time is user time plus system time. 

To obtain a starting feasible flow for our algorithm ive use a maximum flow algorithm. 
The computational investigations of maximum flow algorithm by R.K Ahuja et.el [2] con- 
clude that the networks generated by NETGEN are rather easy classes of networks for 
maximum flow algorithms. In most of the example problems we took in this section, we 
found that the time taken for solving the max flow is negligible. So in our study, the time 
taken for solving the maximum flow problem is not included. 

5.5.4 Computational Results 

We have organised our computational results for the Cycle-code into five tables. All the 
problems shown were generated using the widely used, publicly available NETGEN pro- 
gram. We used the same random seed number 13502460 { the number used by the authors 
of NETGEN). We also provide all additional information needed to replicate these prob- 
lems. 

Table-I (Standard NETGEN Bench Marks) 

This table shows the result for Cycle-code and Relax-II code for the problems described in 
detail by Klingman, Napier and Stutz [40], generated hv the NETGEN program. The re- 
sults shows that for the transportation problems and uiu apacitated and lightly capacitated 
problems, both Relax-II code and Cycle-code have vein' close performance with Relax-II 
code having a slight upper edge over our Cycle-code 

As it has already been reported in [8] Relax-II code performs much better than RNET 
and KILTER for these problems, we can say that Cycle-code has an edge over RNET and 
KILTER. 

For the assignment problems interestingly. Cycle-code has a slight edge over Relax-II 
code.But for the uncapacitated and lightly capacitated problems with large nodes clearly 
Relax-II code is substantially faster than the cycle-code 
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Table-2 (Assignment problems with large cost range) 

These results are in general agreement with those for the assignment problems of 
Table-I. Here the cost range is from 1-10000. For future reference, we also give the optimal 
values of the objective functions. 

Table-3 (Lightly capacitated and high density problems) 

The problems here have a large number of arcs compared to the number of nodes. One can 
see from the table that the increase in the number of arcs does not affect the computational 
behaviour of Cycle-code and its performance is again \-ery close to that of the Relax-II code. 
Table-4 (Problems with lesser flow values) 

In this table we study the nature of problems which have less flow values but are dense. 
From the data available from this table one can see that here too. the performance of the 
Cycle-code is comparable to that of the Relax-II code. 

Table-5 (Transportation problems with large cost range) 

The results in the table are in general agreement with those of the problems 1-10 in Table- 
1. Both the Cycle-code and the Relax-code are performing verj’- close. Even with large 
costs range from 1-10000 the performance of the Cycle-code, is not affected. 

5.5.5 Conclusion 

Based on the computational results in Table 1-5, we can conclude that our Cycle-code is 
comparable to Relax-II code for both standard bench mark problems and a broad range 
of randomly generated problems using NETGEN. As mentioned earlier, Berstsekas and 
Tseng [8] have shown that their Relax code is much faster than ail the classical methods. 
The advantage of our algorithm is in its implementation, which is very simple. The per- 
formance is also very impressive. The Relax-II method adapt non linear programming 
concepts to solve the linear network flow problems. 
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Table 5.1: Standard Benchmark problems of Klingman. Napier and Stutz, obtain using NETGEN 


Problem type 

No. 

No.of 

Nodes 

No.of 

Arcs 

CYCLE-code 

Reiax-code 

Transportation 

1 

200 

1300 

0.14 

0.13 


2 

200 

1500 

0.19 

0.16 


3 

200 

2000 

0.29 

0.20 


4 

200 

2200 

0.32 

0.21 


5 

200 

2900 

0.28 

0.26 


6 

300 

3150 

0.41 

0.30 


7 

300 

4500 

0.68 

0.40 


8 

300 

5155 

0.60 

0.45 


9 

300 

6075 

0.91 

0.51 


10 

300 

6300 

0.85 

0.60 

Total Time(l-lO) 




4.67 

3.22 

Assignment 

11 

400 

1500 

0 12 

0.17 


12 

400 

2250 

0.13 

0.20 


13 

400 

3000 

0.16 

0.23 


14 

400 

3750 

0.23 

0.30 


15 

400 

4500 

0.38 

0.38 

Total Time(ll-15) 




1.02 

1 28 

Uncapacitated 

16 

400 

1306 

0.26 

0.18 

and lightly 

17 

400 

2443 

0.31 

0 24 

Capacitated 

18 

400 

1306 

0.36 

0.14 

problems 

19 

400 

2443 

0.57 

0.23 

(Fewer nodes) 

20 

400 

1416 

0.34 

0.18 


21 

400 

2836 

0.39 

0.26 


22 

400 

1416 

0.33 

0.16 


23 

400 

2836 

0.43 

0.27 


24 

400 

1382 

0.25 

0.12 


25 

400 

2876 

0.46 

0.23 


26 

400 

1382 

0.19 

0.13 


27 

400 

2676 

0.36 

0.19 

Total Time(16-27) 




4.25 

2.33 

(Larger nodes) 

28 

1000 

2900 

1.00 

0.33 


29 

1000 

3400 

1.21 

0.35 


30 

1000 

4400 

1.72 

0.44 


31 

1000 

4800 

1.48 

0.46 


32 

1500 

4342 

1.76 

0.52 


33 

1500 

4385 

1.79 

0.58 


34 

1500 

5107 

1.80 

0.69 


35 

1500 

5730 

1.82 

0.72 

Total Time(28-35) 




12.58 

4.09 
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TABLE-2 This table contains Assignment Problems generated using netgen. For this 
problem the cost range is 1-10000. 

All problems are generated using the Random number= 13502460 


Table 5.2: Assignment Problem 


No. 

No. of 

sources 

No. of 
sinks 

No. of 

arcs 

total 

supply 

Relax-code 

Cycle-code 

Optimal I 
value 

1 

200 

200 

7000 

200 

0.61 

0.62 

140795 

2 

200 

200 

10000 

200 

0 75 

0.70 

112894 

3 

200 

200 

15000 

200 

1.36 

0.90 

83487 

4 

250 

250 

7000 

250 

0 67 

0.62 

206044 

5 

250 

250 

10000 

250 

1 10 

0 92 

160784 

6 

! 250 

250 

15000 

250 

1.95 

1.88 

106467 

7 

400 

400 

7000 

400 

0 84 

0.90 

487123 

8 

400 

400 

10000 

400 

1.00 

1.06 

350562 

9 

400 

400 

15000 

400 

1.53 

1.63 

256151 

10 

500 

500 

7000 

500 

0.74 

1.12 

724689 

Total time 





10 55 

10.35 



Table 5.3: High density problems 


No 

No of Nodes 

No. of Arcs 

Relax-code 

Cycle-code 

Optimal value 

1 

400 

20000 

1.71 

1.82 

386820595 

2. 

400 

25000 

2.73 

2.27 

793083593 

3. 

400 


2.72 

2.63 

311910054 

4. 

500 



2.23 

457795043 

5. 

500 

25000 

2.31 


358003117 

6 

500 


2.82 

3.21 

977418859 

7. 

600 


1.83 

2.85 

539940062 

8. 

600 

25000 

2.47 

2.72 

818453258 

9. 

600 


3.26 

3.61 

379366997 

10 

800 

20000 

1.81 


762503047 

11 

800 

25000 

3.22 

3.51 

1304820042 

12. 

800 


4.26 

4.42 

574490619 

Total time 



31.17 

35.27 
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TABLE-3: TMs table contains Capacitated problems with large number of arcs with 
fewer nodes. These problems are generated with XETGEN. 

All problems are generated using the Random number= 13502460 
Total-flow=400000 Cost-range=l-10000 MIN-CAP= 16000 MAX-CAP=30000 
NO OF SOURCES=8 NO OF SINKS =60 
Percentage of arcs capacitated Min=30%, Max=20‘/^ 

TABLE-4 This table contains Capacitated problems with large number of arcs with fewer 

number of nodes. Here we are having lesser flow values. These problems are generated 

with NETGEN using the Random number 13502460 

Total-flow=10000 cost-range=l-10000 MIN-CAP=1600 MAX-CAP=3000 

Percentage of arcs capacitated Min=30 %, Max=20 9c 

No of sources=8, No of Sinks =60 


Table 5.4: Problems with lesser flow value 


no 

NODES 

ARCS 

Relax-code 

Cycle-code 

Optimal-value 

1. 

400 

7000 

0.64 

OSl 

29355904.00 

2. 

400 

10000 

0.92 

1 17 

24245933.00 

3. 

400 

15000 

1.31 

1.72 

10586044.00 

4. 

500 

7000 

0.90 

1.00 

56949065.00 

5. 

500 

10000 

1.11 

1.24 

20078692.00 

6. 

500 

15000 

1.38 

1.87 

18841929.00 

1 . 

600 

7000 

0.94 

1.32 

47359349.00 

8. 

600 

10000 

1.00 

1.75 

47743815.00 

9. 

600 

15000 

1.41 

1 87 

22243930.00 

10. 

800 

7000 

1 0.60 

191 

60520678.00 

11. 

800 

10000 

1.20 

2 00 

54901837.00 

12. 

800 

15000 

1.54 

2.42 

26405504.00 

Total time 



12.95 

19.08 
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TABLE-5 This table contains transportation problems with larger number of arcs. 

Total-flow=20000 cost-range=l-100 

Random no for generating the problem is= 13502460 


Table 5.5: Transportation problems 


Problem 

no 

No. of Sources 

No. of Sinks 

No.of Arcs 

Relax-code 

Cycle-code 

Optimal-value 

1 

150 

150 

4000 

0.48 

0.61 

250311.00 

2 

200 

200 

4000 

0.50 

0.72 

312893.00 

3 

250 

250 

4000 

0.61 

1.10 

375418.00 

4 

150 

150 

7000 

0.52 

0.75 

179511.00 

5 

200 

200 

7000 

0.63 

0.81 

210261.00 

6 

250 

250 

7000 

0.77 

1.23 

253077.00 

7 

150 

150 

10000 

0.82 

1.11 

128522.00 

8 

200 

200 

10000 

0.91 

1.32 

166806.00 

9 

250 

250 

10000 

1.00 

1.56 

200204.00 

10 

150 

150 

15000 

1.12 

1.71 

119787.00 

11 

200 

200 

15000 

1.23 

2,31 

133184.00 

12 

250 

250 

15000 

1.32 

2.51 

135673.00 

Total time 




9.91 

15.74 



In what follows we give the graphical representation of all the tables given above. It will 
give as a clear picture about the performance of the Relax-code and the Cycle-code to the 
corresponding problems. 




time 
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Figure 5.1; Bench Mark problems 



Figure 5.2: Assignment Problems 
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5.6 Submodular flows 

Edmonds and Giles [18] introduced the submodular flo\v(SF) problem as a generalization 
of the network flow problem. In the SF, the netflows into the subsets are restricted by 
submodular set constraints. In order to formulate the Submodular flow problem we first 
give a few definitions. 

Definition 5.6.1. Let E denote a finite set, T a family of subsets of E. 
f : 7Z. is said to be submodular if for all X,Y E J-, 

f{X) + /(F) > f{X UY)+ fix n F) (5.6.1) 

Two subsets X,Y E are said to be intersecting if none of X- Y, Y-X, X nY is empty. 
X,Y E J~ are said to be crossing if X, Y are intersecting and X \JY ^ E. 
f : TZ IS said to be intersecting submodular function if f satisfies (5.6.1) for all X,Y 

intersecting. 

f : IF —¥ TZ is said to be crossing submodular function if f satisfies (5.6.1) for all X,Y 
crossing. 

Consider a directed graph G(V,E)j with node set V and arc set E, a submodular function 
b' defined on crossing family of subsets of V, lower and upper capacity vectors 1 and u, 
where I : E — oo) u : E [TZ, oo) I < u, and a cost function c : E —¥ IZ. 

For a vector x E 'Rp ACV, denote 

PxiA) = ; (i,j) € E, {i,j) enters A) 

^x(^) = ; {1,3) e E, (i,j) leaves A) 

A^(A) = PxiA) - 5x{A) and 

Q{l,u,b') = [xElZ^-. Ax(A) < 6'(A) V ^ C F and ; < a: < u} 

Qil, u, b') is the submodular flow polyhedron. 
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The SF problem is to 

minimize cx 

subject to x € Q{1, u, b'). (5.6.2) 

A host of network flow problems such as the minimum cost flow problem (MCF) [20] , poly- 
matroidal network flow problem [41], the graph orientation problem [21], poly(matroid) in- 
tersection problem [19] ,the independent flow problem [24] and the directed cutting packing 
problem are all special cases of the SF problem. 

5.6,1 Minimum cost-submodular flows 

In this section we concentrate on the min-cost submodular flow problem, which is a special 
case of (5.6.2). 

Minimize z(x) = ^ CtjXij (5.6.3) 

6j)€Ar 

subject to 

Ax(F) <b'(F)VFCV, F^4>. F, b'(F) < oo 
where 0 < x < u 

A feasible solution of (5.6.3) is called a submodular flow. 

A number of algorithms have been proposed for solving (5.6.3). Frank [23] and H. Rock et.el 
[29] have proposed strongly polynomial algorithms which rely on the ellipsoid algorithm. 
Other solution methodologies are negative circuit [58], primal simplex [6] , primal-dual [15] 
and out-of-kilter [26] which are all closely related to the corresponding algorithms for the 
Minimum cost network flow problems. These algorithms require an oracle which obtains 
the minimising set for a submodular function. 

We have adapted the cycle cancelling algorithm of [52] for solving (5.6.3). Like all other 
algorithms for (5.6.3), this algorithm also requires an oracle for minimizing a submodular 
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function, but in terms of its implementation, it is simpler than the other algorithms. We 
make the following assumptions regarding (5.6.3). 

1. Ail data (cost, capacity) are integer valued 

2. the lower bound on arc flows are all zero. 

3. the min-cost submodular polyhedron is nonempty and bounded. 

4. A starting feasible flow x is available. 

In case a starting feasible flow is not available, the algorithm given in [22] can be 
used to obtain one. 

Since it is simpler to deal with fully submodular functions rather than crossing or inter- 
secting submodular functions, it has been shown in [27] that b' can be replaced by a unique 
fully submodular function b on the right hand side of (5.6.3). such that the set of feasi- 
ble solutions remains the same, that is Q(0,U,b') = Q{0 U.b). and the optimal solution 
obtained with respect to Q(0,U,b) can be used for an optimal solution for (5.6.3) without 
any change. Obtaining a starting feasible solution for Q(0,U,b) is treated in [22] 

5.6.2 Cycle cancelling algorithm 

We will from now on refer to (5.6.3) with the right hand side function b' replaced by the 
fully submodular function b. For solving (5.6.3) we modify the cycle cancelling algorithm 
of [52] which is the basic algorithm for the algorithm dealt in the previous section for the 
minimum cost network flow problem. Our algorithm differs from that of Fujishige’s [27] 
in its procedure for detecting negative cycles and is quite easy to implement. It should 
be possible to further improve our algorithm by adapting the Dynamic share procedure 
presented in the previous section, for the min-cost flow problem. 

Definition 5.6.2. For a directed graph G(V,E) a subset B of V is said to be a uv set if u 


is in B but not v. 
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B a subset of V is said to be x-tight if Xx{B) = b{B). 

It has been shown m [27] that union and intersection of x-tight sets are also x-tight. 

For any feasible flow x, form the auxiliary graph G' = (V' E') in the same manner as 
for the MCF in the previous section, with some additional arcs as follows: 

1. For each arc (i,j) in E with flow Xy < Uy introduce an arc (i,j) in G' with capacity 

Uy = Uy — Xy aUd COSt Cy . 

2. For each arc (i,j)in E with flow Xy > 0 introduce an arc (j,i) in G' with capacity 
y-i] = with cost — Cij. 

3. For all {i,j} in V ,if there is no ij tight set then introduce an arc (i.j) in G' with 
capacity Uy = mm {b{F) — A 2 ;(F’)|F’ is ij set) and cost Cjj = 0. This arc is called 
a jumping arc. 

In the lemma given below we show that computing the capacity Uy of a jumping arc 
requires minimizing a submodular function 

Lemma 5.6.1. For x a feasible flow for (5.6.3), and i,j 6 V 

Minimising {b{F) — Xx{F)\ F is ij set) (5.6.4) 

IS equivalent to minimizing a submodular function. 

Proof let V' = V/{i,j) 

Define f{A) = (6 - A^)(A U ^) V A C V' 

f is submodular on the subsets of V , because {h — A^) is submodular on V, and 
min (6(F) - Ae(F)|F is (z;) set) = min {/'(A), A C V'} 


Hence the lemma. 


□ 
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5.6.3 Optimality conditions 

Let B be a (0 ± 1) matrix with rows corresponding to sets X. with b{X) < oo. The dual 
of problem (5.6.3) is 

Maximize —by — au (5.6.5) 

subject to —yB — al <c y, a > 0 

where the variable a corresponds to the arcs in E and y to the subsets of V. I denotes the 
identity matrix of appropriate size. 

Let us introduce the potential vector tt € to the nodes of V and define 
yF{e) = 'K{j) — 7r(z), for all e = (i,j) E E For details see [18]. 

The complementary slackness conditions, for the primal dual pair (5. 6. 3), (5. 6. 5) as in [15] 
are- 

1. - Tr(i) + 7r{j) < 0 =» Xjj = Uy. 

2. Cjj — 7 r(i) + 7r{j) > 0 Xy = 0. 

3. yT > 0 ^ A,(F) = 6(F), 

that is, Vi, j eV %f 7r(y) > 'ir{i) then 
min(b{F) — Ai(F)|F is ij set) = 0. 

thus in order to solve (5.6.3) we need to find a x feasible for (5 6.3) and tt a set of node 
potentials satisfying the complementary slackness conditions (1),(2) and (3). We now 
define the reduced cost optimality conditions and the e-optimality conditions, and it is the 
e - optimality conditions that we will be working with. 

Reduced cost optimality conditions. 

As for as the MCF problem, by defining c^j = + the complementary slackness 

conditions (1),(2) and (3) for the pair x,7r can be rewritten as 
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1- Cy > 0 V arc (z, j) in G'{x) 

2. V z, j € N, TT{j) > Tr{i), then 

min(b{F) — Xx{F) | F ij set) = 0 

Details for combining complementary slackness conditions (1) and (2) into the reduced 
cost optimality condition (1) can be found in [3]. 

e -optimality. 

A flow X is said to be e-optimal for some e > 0, if together with some potential vector tt, 
it satisfies the following e-optimality conditions. 

1. Cy > — e V arc{i,j) in G(x) 

2. \/i,j € V, such that 7r(j) > 7r(i) , 
min{b(F) - Ax(F)|f’ (if) set) = 0 

We now show that with integer costs, an e optimal flow x is optimal if e < 1/n. 

Theorem 5.6.1. For a minimum cost submodular flow problem with integer costs, any e 
optimal feasible flow is optimal if e < \jn. 

Proof. For any directed cycle W in G'{x), we have > —ne > —1. Since the costs 

are integers this implies that there are no negative cycles in G'{x). For details see (Lemma 
10.2) of [3]. So reduced cost optimality condition (1) is satisfied, e-optimality condition (2) 
is the same as the reduced cost optimality condition (2). Hence any e-optimal flow with 
e < 1/n is optimal for (5.6.3) Hence the theorem. CO 

Defiinition 5.6.3. An arc (i,j) in the auxiliary graph G'{x) is called an admissible arc if 
< 0 - 

We call a graph with only admissible arcs as admissible graph. 

We call the cycles in the admissible graphf consisting of admissible arc) as admissible cycles. 



Cycle cancelling algorithm by sharing optimality violation; 

Let X be a feasible flow obtained by using the algorithm of [22]. Construct the auxiliary' 
graph G'{x) = {V,E’). 

Initialy, take the potential tTi = 0 for all nodes i G I ' 

The reduced cost for (i,j) G E' are, 

Let us make the admissible graph acyclic by augmenting the flow along the cycles, present 
in the admissible graph. 

We then choose a most negative eligible arc say (q.p) and find the set of nodes S reachable 
from p through eligible arcs only. 

Let 5 = V - 5. 

Define e as follows. 
ei = -cjp 

62 = min{'K{j) — 7r(z) | for z G S', j G 5 with 

7r(y) > 7r(z), and min (6(F) — Aa;(F) \ F ib ij set) = 0} 

(this will ensure that the e — optimality condition (2) will continue to be satisfied) 


6 = mzn(ei , ea) 

Now for every node i in S update the potential as follows. 

7r(z) = 7r(z) 4- f . 

Note that ttj > tTi after the update. 

The update will make 

(i) the reduced cost of every arc from S to S reduce by | and 

(ii) the reduced cost of every arc from S to S increase by |. 

Updating the potential may create new eligible arcs in (S', S), while keeping atleast the 
arc (q,p) eligible. We try to create new eligible arcs by this process, and due to this 
the admissible graph becomes cyclic. We make it again acyclic by repeatedly finding and 
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cancelling admissible cycles. 

We cancel the admissible cycles by augmenting the flow as follows: 

Let 

Si = min{uij — Xij | (i, j) is of type (1) in the auxiliary graph} 

$2 = min{a:£j j (^, j) is of type (2) in the auxiliary graph} 

Ss = mm{min{b(F) — Xx{F) j F zs if set) for all jumping arcs} 

and 5 = min{(5i, ^2) >^3} 

For finding <^3 we would need an oracle for minimizing a submodular function. 

The modified flow is 

Xtj + 5, (z,j) is of type (1) m the auxiliary graph 

x[j = < Xy — J, (z,j) is of type (2) in the auxiliary graph 

Xtj, (hj) is a jumping arc 

While augmenting flow from i to j, one may come across parallel arcs and the natural 
choice would be to choose the one with maximum capacity. Henceforth, while augmenting 
flow along a cycle we will choose the arc with maximum capacity. 

If the admissible graph is acyclic find the node p such that the most negative arc is an 
incoming arc to node p, and repeat the above process of updating the potentials. creating 
new eligible arcs, augmenting the flow and finding new auxiliary graph until e > 1/n. Stop 
the process when e < 1/n. 

We now prove the validity of our algorithm. 

Theorem 5.6.2. The flow x and the potential vector tt obtained at the end of our algorithm 
satisfy the reduced optimality conditions. 

Proof. It is clear from the algorithm that at each stage the pair (x,7r) maintains e- 
optimality for some e > 0. The algorithm terminates only when e reduces to a value less 
than 1/n.Thus it follows from theorem 5.6.1 that our algorithm terminates at an optimal 
solution X for (5.6.3). 
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We now give a concise version of our algorithm . 

Algorithm 

begin 

TT = 0, X be any feasible flow 

Cancel admissible cycles by augmenting flow until 
the admissible graph become acyclic. 

while (e > 1/n) do 

begin 

(i) Create admissible arcs as explained above, and update the potentials. 

(ii) Cancel admissible cycles. 

(iii) Form auxiliary graph with respect to the new flow x 
end; 

end; 

One can note that after n-consecutive iterations of the algorithm, the reduced cost of the 
most negative arc in G(x) increases, and hence the value of e will decrease. The integrality 
of costs will ensure the finiteness of the algorithm. 

5.6.4 Complexity 

Assume that an oracle for minimizing a submodular function is available to us vuth time 
complexity o(k). 

Since the starting value of e is max {ctj}=C, and after atmost n consecutive iterations, the 
value of the most negative arc reduces to half of its value, after n(iog(nC)) iteration the 
value will be > —1/n > — e. 

The number of augmentation is therefore atmost o(nm), and each augmentation takes 
o(nh) time. Thus the upper bound on the flow augmentation is o{p^vnk). 
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5.7 Concluding Remarks 

From the results based on Chapter-2 and Chapter-3, it is very clear that for the submodular 
and bisubmodular functions the values of / at the empty set is playing a vital role. Again 
the submodular function / having non zero value at the empty set can be handled easily 
compared to a bisubmodular function / having non zero value at the empty set. For the 
bisubmodular function /, if cj)) < 0 solving the LPP over the associated polyhedron 
seems to be a difficult problem. We feel that in this case also it should be possible to 
exploit the polyhedral structure of the constraint set so that one can develop an efficient 
algorithm for the corresponding LPP. 

Our algorithm for solving an LPP over a submodular polyhedron relies heavily on an oracle 
for minimising a submodular function. 

A polynomial time algorithm for minimising a submodular function will speed up our al- 
gorithm and many other algorithms for which it is need as an oracle. 

From the discussions on increasing bisubmodular function in chapter4 we observe that the 
polyhedral structure of an increasing bisubmodular polyhedron is similar to that of a sub- 
modular polyhedron in many ways. 

Minimising a submodular function can be reduced to a membership problem for a poly- 
matroid polyhedron. The question to be asked is that can minimisation of a bisubmodular 
function be also reduced to a membership problem over an increasing bisubmodular poly- 
hedron. 

The cycle cancelling algorithm in chapter-5 for the minimum cost network flow problem 
simultaneously flnds negative cycles and tightens optimality violations by using the basic 
graph search algorithm. The algorithm is very simple to implement and from the compu- 
tational results we obtain, behaviour of our algorithm is very efficient. We also extend this 
algorithm to solve the minimum cost submodular flow problem. 



It should be possible to further improve our algorithm for solving the minimum cost sub- 
modular flow problem by adapting the Dynamic share procedure presented in this chapter, 
for the minimum cost network flow problem. 
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APPENDIX 


Code for our cycle cancelling algorithm gi\-en in chapters 

#include "stdio.h" 

#include "time.h" 

#include <unistd.h> 

#define infty 500000 

#define true 1 

#define false 0 

void max_flow(void) ; 

void max.augument (void) ; 

int inax_exaiii ( int i) ; 

void inin_cost(void) ; 

int mcLx_viol(void) ; 

int p2gr eater (unsigned int i) ; 

int find (void) ; 

void sscan(int p) ; 

int exam (int i) ; 

void ini_data(void) ; 

void augument(int il,int i2, int jl); 

void shortest_path(int s) ; 

void augument_path(int il.int i2) ; 

int tailCSlOOO] ,head[51000] .cost [51000] ,cap[51000] , 

int trace [51000] .rpoint [4000] , crpoint [4000] ; 

int point [4000] .flow [51000] . cpoint [4000] ; 

int a [4000] [250] ; 

int cou[4000] .distance [4000] .perm [4000] ; 
float mctime; 

int potent [51000] .b [4000] ; 
int scal.fact; 

int le_arc ,total_cost. current; 
int nodes .arcs .nsorc .nsink; 
int pred[4000] .link [51000] .label [4000] ; 
int cval , outflow . key . scounter . current _node ; 

void main(void) 

int 1 . j .k.rcost , counter ; 

FILE *fp; 

fp=f open ("data" . "r"); 

fscanf (fp. "7.d\t 7,d\t */,d\t 7.d\n". fenodes.&arcs.&nsorc.&nsink) ; 
for( 1=1 ;i< arcs+1; ++i) 

fscanf (fp."7.d\t */.d\t 7.d\t 7.d\n ", &tail [i] ,&head[i] .fecost [i] ,&cap[i. 

} 

f close (fp) ; 
ini_data() ; 
max_flow() ; 
min_cost 0 ; 
total_cost=0; 
for(j=l; j<arcs+l; ++j) 

{ 

if (flow[j]>0) 

total_cost=total_cost+(flow[ 3 ]*cost[j]) ; 
printf ("total_cost=7.d\n ", total_cost) ; 
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void ini_data(void) 

{ 

int i,j,k; 

for (i=l;i<iiodes+4;++i) 
cou. [i]=l; 

for (j=l; j<arcs+l,- ++j) 

{ 

i=head[j] ; 
a[i] Ccou[i]]=j; 
cou[i] =cou[i]+l; 

> 


sx—j. , 

do 

■c 

while (a [i] Cj] !=0) 

{ 

trace Ck]=a[i] [j] ; 
k=k+l ; 

} 

1 = 1 + 1 ; 

j=i; 

>while(a[i] [j] !=0) ; 
i=nodes+2; 

j=i: 

while(a[i] [j] !=0) 

-C 

trace Ckl=a[i] [j] ; 

j=j+l; 

k=k+l; 

} 

point [1] =1 ; 
rpo^nt [1]=1; 
k=2; 
j=2; 

for( i=i. <arcs+l; ++i) 

if (head [trace fi]] ! =head [trace [i+l]] ) 

{ 

rpoint Lj]=i+1; 

if (tail [i] ! =tail [i+l] ) 

point [kj=i+l; 
k=k+l; 

} 

rpoint [nodes+2]=rpoint[nodes+l] ; 
point [nodes+2] =arcs+l ; 
rpoint [nodes+3] =arcs+l ; 
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} 


This Subroutine will calculate the starting feasible 
flow by solving the max-flow problem using labelling 
Algorithm. 


void maz_f low (void) 

■C 

int i, j ,k, delta, t ; 

for (i=l; i<Cnodes+4) ;++i) 

■C 

cpoint [i] =point [i] ; 
crpoint [i] =rpoint [i] ; 
pred[i] =0; 

for (j=l; j<arcs+l; ++j) 

flowCj]=0; 

linkCj]=0; 

> 

pred[nodes+l]=-l ; 
delta=0 ; 
j=nodes+l; 
do 

-C 

i=j; 

j=max_exam(i) ; 
if (j==nodes+2) 

max_augument () ; 
j=nodes+l ; 

} 

> while (j !=-l) ; 

} 


int max_exam(int i) 

int care; 

int j, links, k; 

while (cpoint [i] <point [i+l] ) 

carc=cpoint [i] ; 
j=headCccLrc] ; 

if ( (flow [care] <cap [care] ) && (predCj] ==0) ) 

predCj]=i: 
link [j]=carc; 
return(j) ; 


} 

else 

cpoint [i] =cpoint [i] +1 ; 

} 


while (crpoint [i] <rpoint [i+l] ) 

-C 
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carc=trace [crpoint [i] ] ; 
l^r! ^ “p r! jI * 

if ((flow [care] >0)&&(predEk]==0)) 

{ 

pred [k] =i ; 
link [k]=c arc; 
return (k) ; 

> 

else crpoant[i]=crpoint[i]+l; 

> 

j=pred[i] ; 
pred [i] =0 ; 
links=link[i] ; 
if (tail [link [i]]==j) 

++cpoint [j] ; 
else ++crpoint [j] ; 
return j ; 


void max_augument (void) 

■C 

int delta, temp; 
int p,k,i; 
delta=infty; 
p=nodes+2; 
while (p ! =nodes+l) 

k=link[p] ; 
if (head[k]==p) 

if (delta>cap[k] -flow[k] ) delta=cap[k]-flow[k] ; 

} 

else 

if (delta>flow[k]) delta =f low [k] ; 

} 

p=pred [p] ; 

} 

p=nodes+2; 
while (p ! =nodes+l) 

k=link[p] ; 
if (head[k]==p) 

flow[k] +=delta; 

} 

else 

flow [k] -=delta; 

epoint [p] =point [p] ; 
crpoint [p]=rpoint [p] ; 
temp=pred[p] ; 
pred[p] =0; 
p=temp; 

} 
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for (p=l ;p<nodes+3;++p) 

{ 

cpoint [p] =point [p ] ; 
crpoint [p] =rpoint [p] ; 

> 

pred [nodes+1] =-l ; 


Subroutine for the minimum-cost flow problem using 
our new cycle cancelling algorithm disscused in 
chapters 


void min_cost (void) 

-C 

int d,j ,i,max,k,rcost,s; 
mcLX=max_viol() ; 
scal_f act=p2greater(max) ; 
scal_f act=scal_f act/2 ; 
le_arc=0; 

while (scal_f act>=l) 

do 

{ 

i=find() ; 

if (i != nodes+4) 

■C 

pred[i]=-l; 
sscan(i) ; 

> 

}while(i '=nodes+4); 
le_arc=0; 

scal_f act=scal_f act/2 ; 

> 

f or(i=l ; i<nodes+3;++i) 
b[i]=0; 

> 

for(j=l; j<arcs+l;++j) 

rcost=cost [j] -potent [tailEj] ]+potent [headCj]] ; 
if ((flowCj]<cap[j])&&(rcost==-l) ) 

b [tail [j] ] =b [tail [j] ]- (cap [j] -flow [j] ) ; 
b [head [j ] ] =b [head C j ] ] + ( cap [ j ] -flow [j ] ) ; 
flow[j]=cap[j] ; 

if ((flow[j]>0)&fe(rcost==l)) 

b[tail[j]]=b[tail[j]]+flowCj] ; 
b [head [j]]=b[head[j]] -flow [j] ; 
flow[j]=0; 

} 

current _node=l ; 
do 

■C 
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if (b [current _node] >0) 

sliortest_path (current _node) ; 
if (b [current _node] >0) 

current _node=current_node-l ; 

++current_node ; 

y while (current _node ! =nodes+3) ; 


void shortest_path(int s) 

int i , j , dc , smalldis ; 
int Ccirc; 
int newdis,k; 
f or (i=0 ; i<nodes+3 ; ++i) 

-C 


distance [i] =inf ty ; 
penii[i]=0 ; 


} 


pred[i] =0; 
link[i] =0 ; 
label [i] =0 ; 
cpoint [i] =point [i] ; 
crpoint [i] =rpoint [i] ; 


scounter=2; 
label [l]=s; 
perm[s]=l; 
distance [s]=0; 
current =s ; 
do 

smalldis=infty; 

dc=distance [current] ; 

while (cpoint [current] < point [current+1] ) 

carc=cpoint [current] ; 

j=head[carc] ; 

if ( (perm [j] ==0) &&(f low [care] <cap [care] ) ) 
if (pred[j]==0) 

label [scounter] =j ; 

++S counter; 

newdis=dc+(cost [care] -potent [current] +potent [j! 
if (newdis<distance[j]) 

distance [j] =newdis ; 

Pred[j]=current ; 

link[j]=carc ; 

} 


} 


++cpoint [current] ; 


while (crpoint [current] < rpo int [current+1] ) 
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{ 

carc=trace[crpoint[current]] ; 
j=tail [care ] ; 

if ( (perm [j] ==0) &&Cf low [care] >0) ) 

■[ 

if Cpred[j]==0) 

label [scounter]=j ; 

++S counter; 

> 

newdis=dc-(cost [care] -potent [j] +potent [curren- 
if (newdis<distance [j] ) 

distance [j] =newdis ; 
pred [j ] =current ; 
link[j]=carc; 

> 

} 

++crpoint [current] ; 

} 

for(i=l;i<scounter+l; ++i) 

■C 

if (perm [label [i] ] ==0) 

if (smalldis>distance [label [i] ] ) 

smalldis=distance [label [i] ] ; 
current=label [i] ; 

> 

} 

} 

perm [current] =1 ; 

}while (b [current] >=0) ; 
augument_patli( current, s) ; 

} 

void augument_pat]i(int il,int i2) 

int i,j,p, delta; 
delta=infty; 
p=il; 
do 

-C 

j=link[p]; 
if (liead[j]==p) 

if (delta> (cap [j] -f low [j] ) ) 

delta=cap[j]-flow[j] ; 

} 

else if (tail[j]==p) 

if (delta>flow[j]) delta=flow[j] ; 

} 

p=pred Cp] ; 

]• while (p ' =i2) ; 

if (delta>b [i2] ) delta=b[i2] ; 
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if (delta>-b [il] ) delta=-b Cil] ; 

p=il; 

do 

■C 

j=linkCp] ; 
if (liead[j]==p) 

flow[j]=flow[j]+delta; 
else flow [j]=f low Cj]-d.elta; 
p=pred [p] ; 

)■ while (p ! =i2) ; 
b [i2] =b [i2] -delta ; 
b [il] =b [il] +delta; 
for(i=l;i<scotLater+l; ++i) 

j=label[i] ; 
if (perm [j] '=0) 

potent [j] =pote]it [j] -distance [j] +distance [il] ; 

} 

> 

I > 

111 int max_viol(void) 

} < . . 

int J ; 

int maic; 
max=0 ; 

for(j=l; j<arcs+l;j++) 

if ( (flow [j] <cap [j] )&& (cost [j] <0) && (max<-cost [j] ) ) 
max=-cost [j] ; 

else 

f if ( (flow [j] >0) && (cost [j] >0) && (max<cost [j] ) ) 

max=+cost [j] ; 


k return mar ; 

^ ' 


trmt p2greater( unsigned int i) 

int k,kl; 
k=0; 
kl=0; 
while (i) 

i=i»l; 

++k; 

kl=l«k; 
return kl; 



find (void) 

int j.rcost; 
j=le_arc; 
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while (j<arcs+l) 

-C 

j=j+i; 

rcost=cost [j] -potent [tail [j] ] +potent [head [j] ] ; 
if C(flow[j]<cap[j])&&C rcost< -scal_fact)) 

le_arc=j ; 
return head[j ] ; 

> 

else if ((flow[j]>0)&& (rcost>scal_fact) ) 

-C 

le_arc=j ; 
return tail[j] ; 

> 

} 

return nodes+4; 

} 

void sscanCint p) 

■[ 

int j; 

J=p: 

do 

j=exam(j) ; 

} while (j != -1); 

} 

int examCint i) 

int k,j,caxc; 

while (cpoint [i] <point [i+1] ) 

carc=cpoint [i] ; 

j=head[caxc] ; 

if( flow [care] <cap [care]) 

if ( (cost [care] -potent [i] +potent [j] ) < 0) 

if( pred[j] !=0) 

-C 

augumentd, j ,carc) ; 

% return j ; 

} 

else 

pred[j]=i; 
link [j]=carc; 
return j ; 

} 

^ } 

cpoint [i] =cpoint [i] +1 ; 
while (crpoint [i] <rpoint [i+l] ) 
carc=trace[crpoint[i]] ; 



} 


if (flow [care] >0) 

-C 

k=tail [care] ; 

if ( (cost [care] -potent [k] +potent [i] ) 


if (pred [k] ! = 0) 

i 

augument (i,k,carc) ; 
return k; 

> 

else 


} 


} 


pred [k] =i ; 
link [k] =carc ; 
return k; 


} 

crpoint [i] =crpoint [i] +1 ; 

} 

potent [i] =potent [i] +scal_f act ; 
epoint [i] =point [i] ; 
crpoint [i] =rpoint [i] ; 

3 =pred[i] ; 
pred[i]=0; 
return j ; 


> 0 ) 


void augument(int il, int i2, int jl) 

{ 

int p, delta, j , temp; 
p=il; 

delta=infty; 

do 

{ 

j=link[p] ; 
if (head[j]==p) 

if (delta> (cap [j] -flow [j])) 

delta=cap[j]-flow[j] ; 

else if (tail[j]==p) 

li (delta>flow[j]) delta=flow[j] ; 

> 

p=pred[p] ; 

}while (p !=i2); 

if (tail[jl]==il) 

if ((cap[jl]-flow[jl])<delta) delta=cap[jl]-flow[ji] ; 
flow [j l]+=delta; 

> 

else 

if (flow[jl]<delta) 

delta=flow[jl] ; 



flow[jl]-=delta; 


> 

p=il; 
do 

j=liiikCp] ; 

if (liead[j]==p) flowCj]+=delta; 

else flow[j]-=delta; 

temp=p ; 

p=pred[p] ; 

pred [temp] =0 ; 

>wliile (p ! = i2) ; 



